Projectdetail

Grant DOI 10.55776/V745
Funding program Elise Richter
Status ended
Start April 1, 2020
End March 31, 2025
Funding amount € 361,358
Project website

Disciplines

Computer Sciences (100%)

Keywords

Human Computatio,
Crowdsourcing,
Semantic Web,
Ontology,
Ontology Evaluation

Abstract

Final report

In recent years, great breakthroughs have been made in the area of novel intelligent systems such as question answering systems (e.g., IBMs Watson) or conversational agents, also known as chatbots. It is naturally assumed that these systems behave in a way that is both truthful (making factually correct statements) and unbiased (acknowledging multiple, possibly contradictor viewpoints). Yet, the behaviour of these systems highly depends on the quality of world knowledge that they possess. Such world knowledge is often stored in advanced data structures known as ontologies which are typically very large and are extracted automatically from massive text collections. Unfortunately, automatic extraction algorithms are prone to deliver ontologies that contain factual errors or subscribe to a particular viewpoint. Identifying most of such shortcomings is, currently, still beyond being automated and is best performed with human involvement. The aim of HOnEst is to improve the quality of ontologies through scalable human-centric evaluation, so that they enable a factually correct and unbiased (i.e., honest) system behaviour. To that end, HOnEst will firstly refer to current literature to identify those ontology shortcomings that have been reported as requiring humans for their identification. Secondly, it will explore the use of Human Computation (HC) techniques to effectively and efficiently organize human-centric evaluation, by splitting the evaluation task into multiple smaller tasks that can be solved in parallel by a large and distributed workforce. The project will rely on the Design Science method to create HC-based solutions to the evaluation problem and then assess these in order to better understand which ontology shortcomings can actually be identified with HC. Finally, to address scalability concerns, HOnEst proposes the combined use of human and algorithmic components in the context of two real-life, very large ontologies: (1) the CSO ontology containing 15K topics about computer science research and (2) WebIsALOD which aims to increase the diversity of knowledge available to information systems with its 400M automatically extracted relations between so called tail entities (i.e., not very well-known entities that are typically not covered by other state-of-the-art large ontologies, such as those that reflect Wikipedia). In both cases, algorithmic components will determine which parts of the ontology should be evaluated by humans and improve their functionality based on this input, with the possibility of actually learning enough to perform (nearly) as well as humans do. Besides its novel scientific contributions, HOnEst has important implications on ensuring truthful and unbiased behaviour of ontology-based question answering and conversational systems. Such benefits will be already enjoyed by the systems that make use of the two ontologies improved as part of the project.

In recent years, Artificial Intelligence (AI) has undergone tremendous technological developments that have significant impact on all aspects of society from economical changes to sustainability concerns. In particular, we have witnessed the major impact of AI techniques that derive intelligence from large-amounts of data (also known as neural AI). A key representative of this trend are foundational models (frequently leveraged by conversational agents such as ChatGPT) which demonstrated the power of these technologies while highlighting their limitations in terms of lack of transparency in terms of their internal operation and weak factual correctness. As such, the key question arises: how can we leverage the tremendous power of such neural AI techniques while ensuring that AI systems remain transparent and behave truthfully by delivering factually correct information? As a solution approach, "neurosymbolic" architectures augment neural systems (such as foundational models) with factually correct knowledge captured using symbolic AI techniques. In particular, advanced data structures known as ontologies are suitable to capture verified and factually correct knowledge. Ontologies are typically automatically extracted from massive text collections and undergo a quality check that is often performed with human involvement. The aim of the HOnEst project is to improve the quality of ontologies through scalable human-centric evaluation (HCE), so that they enable a factually correct and unbiased (i.e., honest) behaviour of AI systems that rely on them. To address this goal, the project achieved the following results. First, to fully understand this topic, we systematically surveyed 15 years of work on human-centric ontology evaluation. Second, we experimentally explored solutions to ontology evaluation by splitting the evaluation task into multiple smaller tasks that can be solved in parallel by a large and distributed workforce. We proposed reusable process models and technical architectures that others can employ for their own HCE campaigns. Thirdly, we focused on making HCE scalable by combining human and algorithmic components. To that end, we investigated suitable neurosymbolic architectures that combine ontologies and neural AI systems, leading to the identification of over 40 such architecture types. Also, we proposed hybrid human-AI workflows in which humans and algorithmic elements (namely, foundational models) team up for verifying ontologies. We experimentally studied these workflows for verifying the Computer Science Knowledge Graph (CS-KG) which is an ontology-based large scale resource containing 41 Million facts about the Computer Science domain. As such, HOnEst has delivered an essential piece of the overarching puzzle of ensuring powerful and truthful AI systems thus contributing to the safe and effective uptake of these powerful technologies even in mission critical domains where factual correctness is key.

Research institution(s)

Wirtschaftsuniversität Wien - 100%

International project participants

Research Output

2 Citations
33 Publications
4 Policies
5 Datasets & models
1 Software
7 Disseminations
22 Scientific Awards
5 Fundings

Publications

Title	Opportunities for Knowledge Graphs in the AI landscape - An application-centric perspective
DOI	10.1016/j.websem.2025.100867
Type	Journal Article
Author	Motta E
Journal	Journal of Web Semantics

Title	Knowledge Engineering in the Age of Neurosymbolic Systems
DOI	10.1177/29498732251320078
Type	Journal Article
Author	Llugiqi M
Journal	Neurosymbolic Artificial Intelligence

Title	Semantic-Based Data Augmentation for Machine Learning Prediction Enhancement
DOI	10.1177/29498732251340160
Type	Journal Article
Author	Ekaputra F
Journal	Neurosymbolic Artificial Intelligence

Title	Knowledge graph validation by integrating LLMs and human-in-the-loop
DOI	10.1016/j.ipm.2025.104145
Type	Journal Article
Author	Dessì D
Journal	Information Processing & Management

Title	Large Language Model usage when learning Ontology Engineering
Type	Other
Author	Tobias Stubreiter
Link	Publication

Title	DEXA: Supporting Non-Expert Annotators with Dynamic Examples from Experts
DOI	10.1145/3397271.3401334
Type	Conference Proceeding Abstract
Author	Sabou M
Pages	2109-2112
Link	Publication

Title	Ontology Corpora for LLM-based Knowledge Engineering Research
Type	Other
Author	Herwanto G.B.
Pages	-
Link	Publication

Title	Benchmarking Ontology Validation Capabilities of LLMs
Type	Other
Author	Herwanto G.B.
Pages	-
Link	Publication

Title	From Experts to LLMs: Evaluating the Quality of Automatically Generated Ontologies
Type	Other
Author	Ekaputra F.J.
Pages	-
Link	Publication

Title	Validating Semantic Artifacts with Large Language Models; In: The Semantic Web: ESWC 2024 Satellite Events - Hersonissos, Crete, Greece, May 26-30, 2024, Proceedings, Part I
DOI	10.1007/978-3-031-78952-6_9
Type	Book Chapter
Publisher	Springer Nature Switzerland

Title	Enhancing Human-in-the-Loop Ontology Curation Results through Task Design
DOI	10.1145/3626960
Type	Journal Article
Author	Sabou M
Journal	Journal of Data and Information Quality

Title	Enhancing Machine Learning Predictions Through Knowledge Graph Embeddings; In: Neural-Symbolic Learning and Reasoning - 18th International Conference, NeSy 2024, Barcelona, Spain, September 9-12, 2024, Proceedings, Part I
DOI	10.1007/978-3-031-71167-1_15
Type	Book Chapter
Publisher	Springer Nature Switzerland

Title	A process and tool support for human-centred ontology verification
Type	Other
Author	Klemens Käsznar
Link	Publication

Title	Hybrid Human-Machine Evaluation of Knowledge Graphs
Type	Other
Author	Marta Sabou
Conference	Workshop on Human-Centered Design of Symbiotic Hybrid Intelligence, collocated with the first Int. Conf. on Hybrid Human Artificial Intelligence (HHAI)

Title	Verifying Extended Entity Relationship Diagrams with Open Tasks
DOI	10.1609/hcomp.v8i1.7471
Type	Journal Article
Author	Käsznar K
Journal	Proceedings of the AAAI Conference on Human Computation and Crowdsourcing
Link	Publication

Title	Effective Crowd-Annotation of Participants, Interventions, and Outcomes in the Text of Clinical Trial Reports
DOI	10.18653/v1/2020.findings-emnlp.274
Type	Conference Proceeding Abstract
Author	Sabou M
Pages	3064-3074

Title	Effective crowd-annotation of participants, interventions, and outcomes in the text of clinical trial reports
Type	Other
Author	Sabou M.
Pages	3064-3074
Link	Publication

Title	Empirical Software Engineering Experimentation with Human Computation; In: Contemporary Empirical Methods in Software Engineering
DOI	10.1007/978-3-030-32489-6_7
Type	Book Chapter
Publisher	Springer International Publishing

Title	Human Computation Approach for Ontology Restrictions Verification
Type	Conference Proceeding Abstract
Author	Marta Sabou
Conference	AAAI Conference on Human Computation and Crowdsourcing (HComp)
Link	Publication

Title	Human-Centric Ontology Evaluation: A Human Computation approach for ontology restrictions verification
Type	Other
Author	Stefani Tsaneva
Link	Publication

Title	Hybrid Human-Machine Ontology Verification
Type	Other
Author	Alexander Prock
Link	Publication

Title	Enhancing Scientific Knowledge Graph Generation Pipelines with LLMs and Human-in-the-Loop
Type	Other
Author	Dessì D.
Pages	-
Link	Publication

Title	Characteristics of semi-automatic knowledge graph evaluation approaches
Type	Other
Author	Oliver Liu
Link	Publication

Title	Semi-automatic Knowledge Graph Creation and Evaluation
Type	Other
Author	Veronika Hiesinger
Link	Publication

Title	LLM-driven Ontology Evaluation: Verifying Ontology Restrictions with ChatGPT
Type	Other
Author	Tsaneva S.
Pages	15-
Link	Publication

Title	Deriving semantic validation rules from industrial standards: An OPC UA study
DOI	10.3233/sw-233342
Type	Journal Article
Author	Bareedu Y
Journal	Semantic Web
Link	Publication

Title	Combining Machine Learning and Semantic Web: A Systematic Mapping Study
DOI	10.1145/3586163
Type	Journal Article
Author	Breit A
Journal	ACM Computing Surveys

Title	ChatGPT vs Human-in-the-loop: An approach towards automated verification of ontology restrictions
Type	Other
Author	Stefan Vasic
Link	Publication

Title	Benchmarking human-centric ontology defects
Type	Other
Author	Marlene Forman
Link	Publication

Title	Using Human Computation for Ontology evaluation
Type	Other
Author	Orsa Miruna
Link	Publication

Title	Knowledge-centric Prompt Composition for Knowledge Base Construction from Pre-trained Language Models
Type	Other
Author	A. Hughes
Conference	KBC-LM'23: Knowledge Base Construction from Pre-trained Language Models workshop at ISWC
Link	Publication

Title	Describing andOrganizing Semantic Web andMachine Learning Systems intheSWeMLS-KG; In: The Semantic Web - 20th International Conference, ESWC 2023, Hersonissos, Crete, Greece, May 28-June 1, 2023, Proceedings
DOI	10.1007/978-3-031-33455-9_22
Type	Book Chapter
Publisher	Springer Nature Switzerland
Link	Publication

Title	Human-Centric Ontology Evaluation: Process and Tool Support
DOI	10.1007/978-3-031-17105-5_14
Type	Book Chapter
Author	Tsaneva S
Publisher	Springer Nature
Pages	182-197

Policies

Title	Consultation on Austrian research funding policy
Type	Participation in a guidance/advisory committee

Title	Horizon Scanning Workshop on Human-Like AI Systems (European Innovation Council )
Type	Participation in a guidance/advisory committee

Title	Consultation on Emerging Technology Signals - European Innovation Council
Type	Participation in a guidance/advisory committee

Title	Austrian AI Delegation to Canada
Type	Participation in a guidance/advisory committee

Datasets & models

Public Access
Title	Pizza Ontology Axioms Dataset
Type	Database/Collection of data
Link	Link

Public Access
Title	Knowledge Graph Triple Validation by LLMs and Human-in-the-Loop
Type	Database/Collection of data
Link	Link

Public Access
Title	Human-centric Evaluation of Semantic Resources: Systematic Mapping Study Protocol and Data
Type	Database/Collection of data
Link	Link

Public Access
Title	HERO (Human-centric ontology Evaluation pROcess) - Process model
DOI	10.5281/zenodo.7643356
Type	Computer model/algorithm
Link	Link

Public Access
Title	Ontology Corpora for LLM-based Knowledge Engineering Research
Type	Database/Collection of data
Link	Link

Software

Title	HERO (Human-centric ontology Evaluation pROcess) - Core Architecture and Plugins
Link	Link

Disseminations

Title	Panel "Women in Tech Summit" Warsaw
Type	A talk or presentation
Link	Link

Title	Dagstuhl Seminar "Knowledge Graphs and Their Role in the Knowledge Engineering of the 21st Century "
Type	A formal working group, expert panel or dialogue
Link	Link

Title	Panel on AI Ecosystems
Type	A formal working group, expert panel or dialogue

Title	Project website
Type	Engagement focused website, blog or social media channel
Link	Link

Title	Panel on "The AI Revolution"
Type	A formal working group, expert panel or dialogue
Link	Link

Title	Inaugural Lecture: "Humans - The Future of Artificial Intelligence?"
Type	A talk or presentation
Link	Link

Title	WU Magazin
Type	A magazine, newsletter or online publication
Link	Link

Scientific Awards

Title	Keynote Speaker at the IEEE International Conference on Data and Software Engineering (ICoDSE)
Type	Personally asked as a key note speaker to a conference
Level of Recognition	Continental/International

Title	Research visit Nilay Tüfek Özkaya
Type	Attracted visiting staff or user to your research group
Level of Recognition	National (any country)

Title	Research visit Lisa Ehrlinger
Type	Attracted visiting staff or user to your research group
Level of Recognition	National (any country)

Title	Research visit Gianluca Demartini
Type	Attracted visiting staff or user to your research group
Level of Recognition	Continental/International

Title	Keynote Speaket at the BILAI Kick-off Event
Type	Personally asked as a key note speaker to a conference
Level of Recognition	National (any country)

Title	Co-Editor of Special Issue on "Knowledge Graphs in the AI landscape - An application-centric perspective"
Type	Appointed as the editor/advisor to a journal or book series
Level of Recognition	Continental/International

Title	Advisory Board Member: INTEND Project (EU)
Type	Prestigious/honorary/advisory position to an external body
Level of Recognition	Continental/International

Title	Keynote Speaker "Workshop on Semantic Technologies and Deep Learning Models for Scientific, Technical and Legal Data"
Type	Personally asked as a key note speaker to a conference
Level of Recognition	Continental/International

Title	Strategic Advisory Board Member: Digital Humanism Association
Type	Prestigious/honorary/advisory position to an external body
Level of Recognition	National (any country)

Title	Invited Talk at University of Mannheim/Engage.EU
Type	Personally asked as a key note speaker to a conference
Level of Recognition	Continental/International

Title	Co-editor of Special Issue on Knowledge Graphs and Neurosymbolic AI
Type	Appointed as the editor/advisor to a journal or book series
Level of Recognition	Continental/International

Title	Research visit Raghava Mutharaju
Type	Attracted visiting staff or user to your research group
Level of Recognition	Continental/International

Title	Advisory Board Member: Josef Ressel Center Industrial Data Lab
Type	Prestigious/honorary/advisory position to an external body
Level of Recognition	National (any country)

Title	Advisory Board Member: Austrian Lab for AI Trust (ALAIT) Project
Type	Prestigious/honorary/advisory position to an external body
Level of Recognition	National (any country)

Title	Board Member: Austrian Society for Artificial Intelligence
Type	Prestigious/honorary/advisory position to an external body
Level of Recognition	National (any country)

Title	Editorial Board Member - Neurosymbolic Artificial Intelligence Journal
Type	Appointed as the editor/advisor to a journal or book series
Level of Recognition	Continental/International

Title	Research visit María Poveda-Villalón
Type	Attracted visiting staff or user to your research group
Level of Recognition	Continental/International

Title	Research visit Paul Groth
Type	Attracted visiting staff or user to your research group
Level of Recognition	Continental/International

Title	Research visit Heiko Paulheim
Type	Attracted visiting staff or user to your research group
Level of Recognition	Continental/International

Title	Research visit Frank van Harmelen
Type	Attracted visiting staff or user to your research group
Level of Recognition	Continental/International

Title	Invited Talk at University of Mannheim
Type	Personally asked as a key note speaker to a conference
Level of Recognition	Continental/International

Title	Invited Speaker at "Austrian Computer Science Days"
Type	Personally asked as a key note speaker to a conference
Level of Recognition	National (any country)

Fundings

Title	PERKS: Eliciting and Exploiting Procedural Knowledge in Industry 5.0
Type	Research grant (including intramural programme)
Start of Funding	2023
Funder	European Commission

Title	ORE: OPC UA Rule Editor
Type	Research grant (including intramural programme)
Start of Funding	2021
Funder	Siemens AG

Title	Visiting Student Grant for Ms. Stefani Tsaneva
Type	Travel/small personal
Start of Funding	2025
Funder	ARC Industrial Transformation Training Centre for Information Resilience (CIRES)

Title	Doctoral College on Digital Humanism
Type	Research grant (including intramural programme)
Start of Funding	2024
Funder	Vienna Science and Technology Fund

Title	WU International Research Fellow (WU IRF) - Ms. Stefani Tsaneva
Type	Travel/small personal
Start of Funding	2025
Funder	Vienna University of Economics and Business

Go to overview page Discover

Go to overview page Funding

Go to overview page About Us

Go to overview page News

Human-centric Ontology Evaluation (HOnEst)

Disciplines

Keywords

Research Output

Contact

General information

Go to overview page Discover

Go to overview page Funding

Go to overview page About Us

Go to overview page News

SOCIAL MEDIA

SCILOG

Human-centric Ontology Evaluation (HOnEst)

Disciplines

Keywords

Research Output