Human-centric Ontology Evaluation (HOnEst)
Human-centric Ontology Evaluation (HOnEst)
Disciplines
Computer Sciences (100%)
Keywords
-
Human Computatio,
Crowdsourcing,
Semantic Web,
Ontology,
Ontology Evaluation
In recent years, great breakthroughs have been made in the area of novel intelligent systems such as question answering systems (e.g., IBMs Watson) or conversational agents, also known as chatbots. It is naturally assumed that these systems behave in a way that is both truthful (making factually correct statements) and unbiased (acknowledging multiple, possibly contradictor viewpoints). Yet, the behaviour of these systems highly depends on the quality of world knowledge that they possess. Such world knowledge is often stored in advanced data structures known as ontologies which are typically very large and are extracted automatically from massive text collections. Unfortunately, automatic extraction algorithms are prone to deliver ontologies that contain factual errors or subscribe to a particular viewpoint. Identifying most of such shortcomings is, currently, still beyond being automated and is best performed with human involvement. The aim of HOnEst is to improve the quality of ontologies through scalable human-centric evaluation, so that they enable a factually correct and unbiased (i.e., honest) system behaviour. To that end, HOnEst will firstly refer to current literature to identify those ontology shortcomings that have been reported as requiring humans for their identification. Secondly, it will explore the use of Human Computation (HC) techniques to effectively and efficiently organize human-centric evaluation, by splitting the evaluation task into multiple smaller tasks that can be solved in parallel by a large and distributed workforce. The project will rely on the Design Science method to create HC-based solutions to the evaluation problem and then assess these in order to better understand which ontology shortcomings can actually be identified with HC. Finally, to address scalability concerns, HOnEst proposes the combined use of human and algorithmic components in the context of two real-life, very large ontologies: (1) the CSO ontology containing 15K topics about computer science research and (2) WebIsALOD which aims to increase the diversity of knowledge available to information systems with its 400M automatically extracted relations between so called tail entities (i.e., not very well-known entities that are typically not covered by other state-of-the-art large ontologies, such as those that reflect Wikipedia). In both cases, algorithmic components will determine which parts of the ontology should be evaluated by humans and improve their functionality based on this input, with the possibility of actually learning enough to perform (nearly) as well as humans do. Besides its novel scientific contributions, HOnEst has important implications on ensuring truthful and unbiased behaviour of ontology-based question answering and conversational systems. Such benefits will be already enjoyed by the systems that make use of the two ontologies improved as part of the project.
In recent years, Artificial Intelligence (AI) has undergone tremendous technological developments that have significant impact on all aspects of society from economical changes to sustainability concerns. In particular, we have witnessed the major impact of AI techniques that derive intelligence from large-amounts of data (also known as neural AI). A key representative of this trend are foundational models (frequently leveraged by conversational agents such as ChatGPT) which demonstrated the power of these technologies while highlighting their limitations in terms of lack of transparency in terms of their internal operation and weak factual correctness. As such, the key question arises: how can we leverage the tremendous power of such neural AI techniques while ensuring that AI systems remain transparent and behave truthfully by delivering factually correct information? As a solution approach, "neurosymbolic" architectures augment neural systems (such as foundational models) with factually correct knowledge captured using symbolic AI techniques. In particular, advanced data structures known as ontologies are suitable to capture verified and factually correct knowledge. Ontologies are typically automatically extracted from massive text collections and undergo a quality check that is often performed with human involvement. The aim of the HOnEst project is to improve the quality of ontologies through scalable human-centric evaluation (HCE), so that they enable a factually correct and unbiased (i.e., honest) behaviour of AI systems that rely on them. To address this goal, the project achieved the following results. First, to fully understand this topic, we systematically surveyed 15 years of work on human-centric ontology evaluation. Second, we experimentally explored solutions to ontology evaluation by splitting the evaluation task into multiple smaller tasks that can be solved in parallel by a large and distributed workforce. We proposed reusable process models and technical architectures that others can employ for their own HCE campaigns. Thirdly, we focused on making HCE scalable by combining human and algorithmic components. To that end, we investigated suitable neurosymbolic architectures that combine ontologies and neural AI systems, leading to the identification of over 40 such architecture types. Also, we proposed hybrid human-AI workflows in which humans and algorithmic elements (namely, foundational models) team up for verifying ontologies. We experimentally studied these workflows for verifying the Computer Science Knowledge Graph (CS-KG) which is an ontology-based large scale resource containing 41 Million facts about the Computer Science domain. As such, HOnEst has delivered an essential piece of the overarching puzzle of ensuring powerful and truthful AI systems thus contributing to the safe and effective uptake of these powerful technologies even in mission critical domains where factual correctness is key.
- Wirtschaftsuniversität Wien - 100%
Research Output
- 121 Citations
- 33 Publications
- 4 Policies
- 5 Datasets & models
- 1 Software
- 7 Disseminations
- 22 Scientific Awards
- 5 Fundings
-
2023
Title Combining Machine Learning and Semantic Web: A Systematic Mapping Study DOI 10.1145/3586163 Type Journal Article Author Breit A Journal ACM Computing Surveys Pages 1-41 Link Publication -
2023
Title Describing and Organizing Semantic Web and Machine Learning Systems in the SWeMLS-KG DOI 10.1007/978-3-031-33455-9_22 Type Book Chapter Author Ekaputra F Publisher Springer Nature Pages 372-389 -
2024
Title Enhancing Machine Learning Predictions Through Knowledge Graph Embeddings DOI 10.1007/978-3-031-71167-1_15 Type Book Chapter Author Llugiqi M Publisher Springer Nature Pages 279-295 -
2024
Title Deriving semantic validation rules from industrial standards: An OPC UA study DOI 10.3233/sw-233342 Type Journal Article Author Bareedu Y Journal Semantic Web Pages 517-554 Link Publication -
2024
Title Enhancing Human-in-the-Loop Ontology Curation Results through Task Design DOI 10.1145/3626960 Type Journal Article Author Tsaneva S Journal ACM Journal of Data and Information Quality Pages 1-25 Link Publication -
2024
Title Semi-automatic Knowledge Graph Creation and Evaluation Type Other Author Veronika Hiesinger Link Publication -
2024
Title Characteristics of semi-automatic knowledge graph evaluation approaches Type Other Author Oliver Liu Link Publication -
2024
Title LLM-driven Ontology Evaluation: Verifying Ontology Restrictions with ChatGPT Type Other Author Tsaneva S. Pages 15- Link Publication -
2024
Title Enhancing Scientific Knowledge Graph Generation Pipelines with LLMs and Human-in-the-Loop Type Other Author Dessì D. Pages - Link Publication -
2025
Title Validating Semantic Artifacts with Large Language Models DOI 10.1007/978-3-031-78952-6_9 Type Book Chapter Author Tufek N Publisher Springer Nature Pages 92-101 -
2025
Title Large Language Model usage when learning Ontology Engineering Type Other Author Tobias Stubreiter Link Publication -
2025
Title From Experts to LLMs: Evaluating the Quality of Automatically Generated Ontologies Type Other Author Ekaputra F.J. Pages - Link Publication -
2025
Title Benchmarking Ontology Validation Capabilities of LLMs Type Other Author Herwanto G.B. Pages - Link Publication -
2025
Title Ontology Corpora for LLM-based Knowledge Engineering Research Type Other Author Herwanto G.B. Pages - Link Publication -
2025
Title Opportunities for Knowledge Graphs in the AI landscape - An application-centric perspective DOI 10.1016/j.websem.2025.100867 Type Journal Article Author Presutti V Journal Journal of Web Semantics Pages 100867 Link Publication -
2025
Title Knowledge graph validation by integrating LLMs and human-in-the-loop DOI 10.1016/j.ipm.2025.104145 Type Journal Article Author Tsaneva S Journal Information Processing & Management Pages 104145 Link Publication -
2025
Title Semantic-Based Data Augmentation for Machine Learning Prediction Enhancement DOI 10.1177/29498732251340160 Type Journal Article Author Ekaputra F Journal Neurosymbolic Artificial Intelligence -
2025
Title Knowledge Engineering in the Age of Neurosymbolic Systems DOI 10.1177/29498732251320078 Type Journal Article Author Llugiqi M Journal Neurosymbolic Artificial Intelligence -
2020
Title Empirical Software Engineering Experimentation with Human Computation; In: Contemporary Empirical Methods in Software Engineering DOI 10.1007/978-3-030-32489-6_7 Type Book Chapter Publisher Springer International Publishing -
2020
Title DEXA: Supporting Non-Expert Annotators with Dynamic Examples from Experts DOI 10.1145/3397271.3401334 Type Conference Proceeding Abstract Author Sabou M Pages 2109-2112 Link Publication -
2023
Title ChatGPT vs Human-in-the-loop: An approach towards automated verification of ontology restrictions Type Other Author Stefan Vasic Link Publication -
2023
Title Knowledge-centric Prompt Composition for Knowledge Base Construction from Pre-trained Language Models Type Other Author A. Hughes Conference KBC-LM'23: Knowledge Base Construction from Pre-trained Language Models workshop at ISWC Link Publication -
2023
Title Using Human Computation for Ontology evaluation Type Other Author Orsa Miruna Link Publication -
2023
Title Benchmarking human-centric ontology defects Type Other Author Marlene Forman Link Publication -
2022
Title Human-Centric Ontology Evaluation: Process and Tool Support DOI 10.1007/978-3-031-17105-5_14 Type Book Chapter Author Tsaneva S Publisher Springer Nature Pages 182-197 -
2022
Title A process and tool support for human-centred ontology verification Type Other Author Klemens Käsznar Link Publication -
2022
Title Hybrid Human-Machine Evaluation of Knowledge Graphs Type Other Author Marta Sabou Conference Workshop on Human-Centered Design of Symbiotic Hybrid Intelligence, collocated with the first Int. Conf. on Hybrid Human Artificial Intelligence (HHAI) -
2021
Title Human Computation Approach for Ontology Restrictions Verification Type Conference Proceeding Abstract Author Marta Sabou Conference AAAI Conference on Human Computation and Crowdsourcing (HComp) Link Publication -
2021
Title Human-Centric Ontology Evaluation: A Human Computation approach for ontology restrictions verification Type Other Author Stefani Tsaneva Link Publication -
2021
Title Hybrid Human-Machine Ontology Verification Type Other Author Alexander Prock Link Publication -
2020
Title Effective crowd-annotation of participants, interventions, and outcomes in the text of clinical trial reports Type Other Author Sabou M. Pages 3064-3074 Link Publication -
2020
Title Verifying Extended Entity Relationship Diagrams with Open Tasks DOI 10.1609/hcomp.v8i1.7471 Type Journal Article Author Käsznar K Journal Proceedings of the AAAI Conference on Human Computation and Crowdsourcing Link Publication -
2020
Title Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial Reports DOI 10.18653/v1/2020.findings-emnlp.274 Type Conference Proceeding Abstract Author Sabou M Pages 3064-3074
-
2025
Title Consultation on Austrian research funding policy Type Participation in a guidance/advisory committee -
2025
Title Horizon Scanning Workshop on Human-Like AI Systems (European Innovation Council ) Type Participation in a guidance/advisory committee -
2024
Title Consultation on Emerging Technology Signals - European Innovation Council Type Participation in a guidance/advisory committee -
2023
Title Austrian AI Delegation to Canada Type Participation in a guidance/advisory committee
-
0
Link
Title Ontology Corpora for LLM-based Knowledge Engineering Research Type Database/Collection of data Public Access Link Link -
2024
Link
Title Pizza Ontology Axioms Dataset Type Database/Collection of data Public Access Link Link -
2024
Link
Title Knowledge Graph Triple Validation by LLMs and Human-in-the-Loop Type Database/Collection of data Public Access Link Link -
2024
Link
Title Human-centric Evaluation of Semantic Resources: Systematic Mapping Study Protocol and Data Type Database/Collection of data Public Access Link Link -
2023
Link
Title HERO (Human-centric ontology Evaluation pROcess) - Process model DOI 10.5281/zenodo.7643356 Type Computer model/algorithm Public Access Link Link
-
2022
Link
Title Dagstuhl Seminar "Knowledge Graphs and Their Role in the Knowledge Engineering of the 21st Century " Type A formal working group, expert panel or dialogue Link Link -
2023
Title Panel on AI Ecosystems Type A formal working group, expert panel or dialogue -
2023
Link
Title Panel on "The AI Revolution" Type A formal working group, expert panel or dialogue Link Link -
2024
Link
Title Project website Type Engagement focused website, blog or social media channel Link Link -
2023
Link
Title WU Magazin Type A magazine, newsletter or online publication Link Link -
2022
Link
Title Inaugural Lecture: "Humans - The Future of Artificial Intelligence?" Type A talk or presentation Link Link -
2023
Link
Title Panel "Women in Tech Summit" Warsaw Type A talk or presentation Link Link
-
2025
Title Research visit Lisa Ehrlinger Type Attracted visiting staff or user to your research group Level of Recognition National (any country) -
2025
Title Keynote Speaker at the IEEE International Conference on Data and Software Engineering (ICoDSE) Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2025
Title Keynote Speaket at the BILAI Kick-off Event Type Personally asked as a key note speaker to a conference Level of Recognition National (any country) -
2025
Title Co-Editor of Special Issue on "Knowledge Graphs in the AI landscape - An application-centric perspective" Type Appointed as the editor/advisor to a journal or book series Level of Recognition Continental/International -
2025
Title Research visit Gianluca Demartini Type Attracted visiting staff or user to your research group Level of Recognition Continental/International -
2025
Title Research visit Nilay Tüfek Özkaya Type Attracted visiting staff or user to your research group Level of Recognition National (any country) -
2024
Title Strategic Advisory Board Member: Digital Humanism Association Type Prestigious/honorary/advisory position to an external body Level of Recognition National (any country) -
2024
Title Invited Talk at University of Mannheim/Engage.EU Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2024
Title Co-editor of Special Issue on Knowledge Graphs and Neurosymbolic AI Type Appointed as the editor/advisor to a journal or book series Level of Recognition Continental/International -
2024
Title Advisory Board Member: Josef Ressel Center Industrial Data Lab Type Prestigious/honorary/advisory position to an external body Level of Recognition National (any country) -
2024
Title Advisory Board Member: INTEND Project (EU) Type Prestigious/honorary/advisory position to an external body Level of Recognition Continental/International -
2024
Title Research visit Raghava Mutharaju Type Attracted visiting staff or user to your research group Level of Recognition Continental/International -
2024
Title Keynote Speaker "Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data" Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2024
Title Advisory Board Member: Austrian Lab for AI Trust (ALAIT) Project Type Prestigious/honorary/advisory position to an external body Level of Recognition National (any country) -
2023
Title Research visit Paul Groth Type Attracted visiting staff or user to your research group Level of Recognition Continental/International -
2023
Title Board Member: Austrian Society for Artificial Intelligence Type Prestigious/honorary/advisory position to an external body Level of Recognition National (any country) -
2023
Title Editorial Board Member - Neurosymbolic Artificial Intelligence Journal Type Appointed as the editor/advisor to a journal or book series Level of Recognition Continental/International -
2023
Title Research visit MarÃa Poveda-Villalón Type Attracted visiting staff or user to your research group Level of Recognition Continental/International -
2022
Title Research visit Heiko Paulheim Type Attracted visiting staff or user to your research group Level of Recognition Continental/International -
2022
Title Research visit Frank van Harmelen Type Attracted visiting staff or user to your research group Level of Recognition Continental/International -
2021
Title Invited Speaker at "Austrian Computer Science Days" Type Personally asked as a key note speaker to a conference Level of Recognition National (any country) -
2021
Title Invited Talk at University of Mannheim Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International
-
2021
Title ORE: OPC UA Rule Editor Type Research grant (including intramural programme) Start of Funding 2021 Funder Siemens AG -
2025
Title Visiting Student Grant for Ms. Stefani Tsaneva Type Travel/small personal Start of Funding 2025 Funder ARC Industrial Transformation Training Centre for Information Resilience (CIRES) -
2024
Title Doctoral College on Digital Humanism Type Research grant (including intramural programme) Start of Funding 2024 Funder Vienna Science and Technology Fund -
2023
Title PERKS: Eliciting and Exploiting Procedural Knowledge in Industry 5.0 Type Research grant (including intramural programme) Start of Funding 2023 Funder European Commission -
2025
Title WU International Research Fellow (WU IRF) - Ms. Stefani Tsaneva Type Travel/small personal Start of Funding 2025 Funder Vienna University of Economics and Business