Interactive Spreadsheet Debugging
Interactive Spreadsheet Debugging
Disciplines
Computer Sciences (100%)
Keywords
-
Testing,
Spreadsheets,
End User Programming,
Debugging
Electronic spreadsheets, e.g., based on Microsoft Excel, are widely used in organizations for a variety of purposes like budgeting, forecasting, and financial planning. In many cases, the calculations within such spreadsheets are directly used for managerial decision-making. Errors in spreadsheets can therefore represent a major risk to businesses and there are plenty of examples where such errors have led to significant financial losses. Unfortunately, errors in spreadsheets are not uncommon, and studies show that the chance of a complex spreadsheet to contain at least one error is quite high. Several countermeasures can be applied to minimize the risk that errors remain in a spreadsheet before it is used. One specific approach is to provide the spreadsheet developer with better tools to test the spreadsheet for correctness and to find the specific location of those formulas that at the end lead to a wrong calculation outcome. Specifically for the latter task, the identification of faulty formulas, a number of algorithms were proposed that have the goal to determine a ranked list of suspicious formulas. Recent research in related fields however stipulates that presenting such a ranked list as the only debugging aid might not be enough to be truly useful for the spreadsheet developer. In this project, we will therefore explore new approaches for interactive spreadsheet debugging, building on the results of our previous research on algorithmic approaches to fault localization. Specifically, we will investigate approaches where the user is interactively guided by the system to the true fault location in an iterative process. Furthermore, we will explore explanation mechanisms, where users can ask why and why not certain calculation outcomes are observed. To achieve these goals and real-time interaction, we will additionally explore improvements to the underlying algorithms. In line with current streams of research in general software engineering, we will empirically validate our research approaches not only based on simulation experiments, but with the help of different user studies. As an additional result, we will obtain new insights about the general behavior and strategies of users when they design, test, and debug their spreadsheets.
Electronic spreadsheets, e.g., based on Microsoft Excel, are widely used in organizations for a variety of purposes like budgeting, forecasting, and financial planning. In many cases, the calculations within such spreadsheets are directly used for managerial decision-making. Errors in spreadsheets can therefore represent a major risk to businesses and there are plenty of examples where such errors have led to significant financial losses. Unfortunately, errors in spreadsheets are not uncommon, and studies show that the chance of a complex spreadsheet to contain at least one error is quite high. Several countermeasures can be applied to minimize the risk that errors remain in a spreadsheet before it is used. One specific approach is to provide the spreadsheet developer with better tools to test the spreadsheet for correctness and to find the specific location of those formulas that at the end lead to a wrong calculation outcome. Specifically for the latter task, the identification of faulty formulas, a number of algorithms were proposed that have the goal to determine a ranked list of "suspicious" formulas. Recent research in related fields however stipulates that presenting such a ranked list as the only debugging aid might not be enough to be truly useful for the spreadsheet developer. In this project, new approaches to interactive debugging of spreadsheets were investigated. Specifically, algorithms were developed in which the user is guided by the system in an iterative process to the true cause of the error in an interactive way. Additionally, explanation mechanisms were designed that allow the user to understand why a formula was marked by the system as potentially erroneous. In the field of algorithmic research, approaches from classical artificial intelligence as well as new methods based on machine learning were designed or further developed. The research aimed at increasing the accuracy of error prediction methods and reducing the required computational effort. The research results achieved were successfully published in reputable scientific outlets in both the field of artificial intelligence and software engineering. As purely data- or simulation-based approaches only provide partial insights into the utility of a debugging tool, two important user studies were conducted within the project. These studies yielded valuable insights into how users interact with such tools. It was particularly noticeable that some users overly relied on the tool and focused too much on the errors it highlighted as potentially faulty. The second study revealed that providing explanations to users during error analysis can be helpful. The results of these studies were published in two articles in a reputable journal in the field of software engineering.
- Technische Universität Graz - 40%
- Universität Klagenfurt - 60%
- Franz Wotawa, Technische Universität Graz , associated research partner
- Rui Abreu, University of Lisbon - Portugal
Research Output
- 43 Citations
- 37 Publications
- 3 Datasets & models
- 1 Scientific Awards
-
2024
Title Sequential Model-Based Diagnosis by Systematic Search (Abstract Reprint) DOI 10.1609/aaai.v38i20.30609 Type Journal Article Author Rodler P Journal Proceedings of the AAAI Conference on Artificial Intelligence -
2024
Title Investigating Reproducibility in Deep Learning-Based Software Fault Prediction DOI 10.48550/arxiv.2402.05645 Type Preprint Author Jannach D Link Publication -
2023
Title Explaining software fault predictions to spreadsheet users DOI 10.1016/j.jss.2023.111676 Type Journal Article Author Hofer B Journal Journal of Systems and Software -
2024
Title Investigating Reproducibility in Deep Learning-Based Software Fault Prediction DOI 10.1109/qrs62785.2024.00038 Type Conference Proceeding Abstract Author Jannach D Pages 306-317 -
2025
Title Choosing abstraction levels for model-based software debugging: A theoretical and empirical analysis for spreadsheet programs DOI 10.1016/j.artint.2025.104399 Type Journal Article Author Hofer B Journal Artificial Intelligence -
2023
Title DynamicHS: Streamlining Reiter's Hitting-Set Tree for Sequential Diagnosis DOI 10.1016/j.ins.2022.08.029 Type Journal Article Author Rodler P Journal Information Sciences -
2023
Title Memory-Limited Model-Based Diagnosis (Extended Abstract) DOI 10.24963/ijcai.2023/789 Type Conference Proceeding Abstract Author Rodler P Pages 6954-6958 -
2023
Title Sequential model-based diagnosis by systematic search DOI 10.1016/j.artint.2023.103988 Type Journal Article Author Rodler P Journal Artificial Intelligence -
2023
Title How Should I Compute My Candidates? A Taxonomy and Classification of Diagnosis Computation Algorithms; In: ECAI 2023 - 26th European Conference on Artificial Intelligence, September 30-October 4, 2023, Krakw, Poland - Including 12th Conference on Prestigious Applications of Intelligent Systems (PAIS 2023) DOI 10.3233/faia230490 Type Book Chapter Publisher IOS Press -
2023
Title Advancing Spreadsheet Quality Assurance: A Novel Fault Localization Approach, User-Centric Evaluations of Explainable Faults, and Tool Over-reliance Type Other Author Mukhtar A -
2023
Title Don't Treat the Symptom, Find the Cause! Efficient Artificial-Intelligence Methods for (Interactive) Debugging Type Other Author Rodler P Link Publication -
2022
Title Boosting Spectrum-Based Fault Localization for Spreadsheets with Product Metrics in a Learning Approach DOI 10.5281/zenodo.6826794 Type Conference Proceeding Abstract Author Hofer B Link Publication -
2022
Title Boosting Spectrum-Based Fault Localization for Spreadsheets with Product Metrics in a Learning Approach DOI 10.5281/zenodo.6826795 Type Conference Proceeding Abstract Author Hofer B Link Publication -
2022
Title Boosting Spectrum-Based Fault Localization for Spreadsheets with Product Metrics in a Learning Approach DOI 10.1145/3551349.3559546 Type Conference Proceeding Abstract Author Mukhtar A Pages 1-5 Link Publication -
2021
Title Randomized Problem-Relaxation Solving for Over-Constrained Schedules DOI 10.24963/kr.2021/72 Type Conference Proceeding Abstract Author Rodler P Pages 696-701 -
2022
Title Random vs. Best-First: Impact of Sampling Strategies on Decision Making in Model-Based Diagnosis DOI 10.1609/aaai.v36i5.20531 Type Journal Article Author Rodler P Journal Proceedings of the AAAI Conference on Artificial Intelligence Pages 5869-5878 Link Publication -
2021
Title Linear-Space Best-First Diagnosis Search DOI 10.1609/socs.v12i1.18579 Type Journal Article Author Rodler P Journal Proceedings of the International Symposium on Combinatorial Search Pages 188-190 Link Publication -
2021
Title Too Good to Throw Away: A Powerful Reuse Strategy for Reiter's Hitting Set Tree DOI 10.1609/socs.v11i1.18527 Type Journal Article Author Rodler P Journal Proceedings of the International Symposium on Combinatorial Search Pages 135-136 Link Publication -
2021
Title Appendix to the Paper: DynamicHS: Streamlining Reiter's Hitting Set Tree for Sequential Diagnosis Type Other Author Rodler P Link Publication -
2022
Title Memory-limited model-based diagnosis DOI 10.1016/j.artint.2022.103681 Type Journal Article Author Rodler P Journal Artificial Intelligence Pages 103681 Link Publication -
2022
Title Spreadsheet debugging: The perils of tool over-reliance DOI 10.1016/j.jss.2021.111119 Type Journal Article Author Mukhtar A Journal Journal of Systems and Software Pages 111119 Link Publication -
2022
Title One step at a time: An efficient approach to query-based ontology debugging DOI 10.1016/j.knosys.2022.108987 Type Journal Article Author Rodler P Journal Knowledge-Based Systems Pages 108987 Link Publication -
2022
Title A formal proof and simple explanation of the QuickXplain algorithm DOI 10.1007/s10462-022-10149-w Type Journal Article Author Rodler P Journal Artificial Intelligence Review Pages 6185-6206 Link Publication -
2021
Title Comprehending Spreadsheets: Which Strategies do Users Apply? DOI 10.1109/icpc52881.2021.00044 Type Conference Proceeding Abstract Author Hodnigg K Pages 386-390 -
2021
Title Product metrics for spreadsheets—A systematic review DOI 10.1016/j.jss.2021.110910 Type Journal Article Author Hofer B Journal Journal of Systems and Software Pages 110910 Link Publication -
2022
Title How should I compute my candidates? A taxonomy and classification of diagnosis computation algorithms Type Conference Proceeding Abstract Author Rodler P Conference 33rd International Workshop on Principle of Diagnosis - DX 2022 Link Publication -
2022
Title Appendix to the Paper: Sequential Model-Based Diagnosis by Systematic Search Type Other Author Rodler P Link Publication -
2022
Title RBF-HS: Recursive best-first hitting set search Type Other Author Rodler P Link Publication -
2021
Title AI-based Spreadsheet Debugging DOI 10.1142/9789811239922_0013 Type Book Chapter Author Schekotihin K Publisher World Scientific Publishing Pages 371-399 -
2020
Title Understanding the QuickXPlain algorithm: Simple explanation and formal proof Type Other Author Rodler P Link Publication -
2020
Title On Expert Behaviors and Question Types for Efficient Query-Based Ontology Fault Localization Type Other Author Rodler P Link Publication -
2020
Title DynamicHS: Optimizing Reiter's HS-Tree for Sequential Diagnosis Type Conference Proceeding Abstract Author Rodler P Conference 31st International Workshop on Principles of Diagnosis: DX-2020 Link Publication -
2020
Title Do we really sample right in model-based diagnosis? Type Conference Proceeding Abstract Author Elichanova F Conference 31st International Workshop on Principles of Diagnosis: DX-2020 Link Publication -
2020
Title The Scheduling Job-Set Optimization Problem: A Model-Based Diagnosis Approach Type Conference Proceeding Abstract Author Rodler P Conference 31st International Workshop on Principles of Diagnosis: DX-2020 Link Publication -
2020
Title Sound, Complete, Linear-Space, Best-First Diagnosis Search Type Conference Proceeding Abstract Author Rodler P Conference 31st International Workshop on Principles of Diagnosis: DX-2020 Link Publication -
2020
Title Reuse, Reduce and Recycle: Optimizing Reiter's HS-Tree for Sequential Diagnosis Type Conference Proceeding Abstract Author Rodler P Conference ECAI 2020 Pages 873-880 Link Publication -
0
DOI 10.1145/3551349 Type Other
-
2023
Link
Title Explaining Software Fault Predictions to Spreadsheet Users DOI 10.5281/zenodo.7676963 Type Database/Collection of data Public Access Link Link -
2022
Link
Title Learning- and Spectrum-Based Fault Localization for Spreadsheets DOI 10.5281/zenodo.6826795 Type Computer model/algorithm Public Access Link Link -
2021
Link
Title Spreadsheet Debugging: The Perils of Tool Over-reliance DOI 10.5281/zenodo.5533461 Type Database/Collection of data Public Access Link Link
-
2023
Title Recent advances in debugging spreadsheets Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International