Ontology-mediated Queries for Graph Databases
Disciplines
Computer Sciences (80%); Mathematics (20%)
Keywords
- Description Logics,
- Graph Databases,
- Ontology-based Data Access,
- Database Theory,
- Knowledge Representation and Reasoning
Modern software applications need to manage and access increasingly complex data, which often does not conform to rigid schemas used in traditional databases. Graph databases and Description Logics (DLs) provide complementary tools for managing such loosely-structured information. For example, an ontology written in a DL can be used to capture knowledge about a given application domain in a machine readable format. This knowledge can then be used for answering ontology-mediated queries (OMQs): a user query is evaluated not directly over the available data, but a reasoning engine is employed to incorporate the knowledge in the ontology, possibly inferring facts that are not explicitly present in the data. OMQs and queries over graph databases have many similarities, but they also have major differences that we aim to reconcile in this project. In particular, when answering OMQs one makes the so-called open-world assumption (information is assumed to be incomplete), while the closed-world assumption is made in graph databases (information completeness is assumed). It is acknowledged however that both assumptions are too strong, and that the users should have the ability to specify which part of the data is to be assumed as complete and which as incomplete. Second, OMQs do not support navigational features that are very common in languages for querying graph databases and other representations of loosely- structured data (e.g., the path expressions in the XPath query language for XML documents, or the property paths in the SPARQL query language for RDF data). In this project, we will consider extensions of standard OMQs with features that overcome the above two drawbacks, i.e. we will study OMQs that allow to mix the open-world and the closed-world assumptions, and that simultaneously support navigational features. For such rich OMQs, we will study the following fundamental database-theoretic properties: - The first major goal is to compare the expressive power of the considered OMQs with the expressive power of classic database query languages. To this end, we will mainly develop translations from OMQs into suitable variations of Datalog. This will also enable us to transfer existing results from the setting of Datalog to the setting of OMQs, e.g., to reuse existing efficient Datalog engines for answering OMQs. - The second major goal is to study the computational complexity of reasoning tasks to support the design of OMQs. In particular, we will investigate the query containment and the query emptiness problems for the OMQs introduced in the project. These static analysis problems lie at the core of query design and optimization techniques.
Modern software applications need to manage and access increasingly complex data, which often does not conform to rigid schemas used in traditional databases. Description Logics and Graph Databases provide useful but complementary tools for managing such loosely-structured information. Most of the results of this project were on obtaining new data management and access methods that enjoy the positive features stemming from these two different areas. For instance, Description Logics enable the so-called ontology-mediated queries (OMQs), where domain knowledge expressed as an ontology can be used to enable automated reasoning, which in turn can be used for enhanced access to (possibly incomplete) data. In contrast, Graph Databases use query languages that are usually more expressive than the ones used in Description Logics, usually featuring some form of recursion. In addition to the differences in expressiveness, the two areas make different assumptions about data; inheriting the positive features of the two areas requires reconciliation of those different assumptions. For instance, when answering OMQs one makes the so-called open-world assumption (information is assumed to be incomplete), while the closed-world assumption is made in graph databases (information completeness is assumed). This project provided significant contributions to the logical foundations of managing and querying graph-structured data with the help of Description Logics, thus helping to bridge the gap between Description Logics and Graph Databases. Our main results can be summarized as follows: - We developed a collection of translations from OMQs into suitable variations of Datalog, which is a classic query language in Database Theory. The existence of such rewritings was not known prior to our work, which showed that classical query languages can succinctly express the more recent and seemingly more powerful (ontology-mediated) query languages. - We have introduced new logic-based formalisms for data management (Resilient Logic Programs, Clopen Knowledge Bases, Triguarded Fragment), and we have studied their semantic and computational properties. - We have introduced new ways to query graph-structured data. For instance, we introduced a framework for OMQs that have conditional query answers, which allows for a more powerful and flexible formulation of user queries. Another example is the introduced framework and foundational results for an interactive exploration of data via ontology-mediated queries. - We have studied the theoretical foundations of the W3C SHACL standard, which is a new standard that will help maintain the quality of graph-structured data on the Web. We have proposed a new interpretation (called "semantics") for recursive SHACL integrity constraints, which fills in some important gaps in the original SHACL standard. We believe our work is likely to influence the future revisions of this standard by W3C.
- Wirtschaftsuniversität Wien
- Technische Universität Wien
- Axel Polleres, Wirtschaftsuniversität Wien , associated research partner
Research Output
- 134 Citations
- 45 Publications
- 4 Software
- 3 Disseminations
- 12 Scientific Awards
- 2 Fundings
-
2024
Title Datalog rewritability and data complexity of ALCHOIQ with closed predicates DOI 10.1016/j.artint.2024.104099 Type Journal Article Author Lukumbuzya S Journal Artificial Intelligence -
2022
Title Magic shapes for SHACL validation DOI 10.14778/3547305.3547329 Type Journal Article Author Ahmetaj S Journal Proceedings of the VLDB Endowment Pages 2284-2296 Link Publication -
2020
Title Query Rewriting for Ontology-Mediated Conditional Answers Type Conference Proceeding Abstract Author Magdalena Ortiz Conference AAAI 2020 - 34th Conference on Artificial Intelligence, 32nd Innovative Applications of Artificial Intelligence Conference, 10th Symposium on Educational Advances in Artificial Intelligence (EAAI) -
2020
Title Special Issue on Ontologies and Data Management: Part I Type Journal Article Author Mantas Simkus Journal Künstliche Intelligenz (KI) Pages 287-289 -
2020
Title Special Issue on Ontologies and Data Management: Part II Type Journal Article Author Mantas Simkus Journal Künstliche Intelligenz (KI) Pages 439-441 -
2020
Title Polynomial rewritings from expressive Description Logics with closed predicates to variants of Datalog DOI 10.1016/j.artint.2019.103220 Type Journal Article Author Ahmetaj S Journal Artificial Intelligence Pages 103220 Link Publication -
2019
Title Relaxing and Restraining Queries for OBDA DOI 10.1609/aaai.v33i01.33012654 Type Journal Article Author Andresel M Journal Proceedings of the AAAI Conference on Artificial Intelligence Pages 2654-2661 Link Publication -
2020
Title Pebble-Intervals Automata and FO with Two Orders DOI 10.1007/978-3-030-40608-0_14 Type Book Chapter Author Labai N Publisher Springer Nature Pages 208-221 -
2020
Title Ontology focusing: Knowledge-enriched databases on demand Type Conference Proceeding Abstract Author Gogacz T. Conference 24th European Conference on Artificial Intelligence, ECAI 2020 Pages 745-752 Link Publication -
2018
Title Taming Complex Role Inclusions for DL-Lite Type Conference Proceeding Abstract Author Andresel Medina Conference Proceedings of the 31st International Workshop on Description Logics, DL 2018 Link Publication -
2018
Title The Triguarded Fragment of First-order Logic Type Conference Proceeding Abstract Author Mantas Simkus Conference 22nd International Conference on Logic for Programming, Artificial Intelligence and Reasoning (LPAR) Link Publication -
2018
Title Rewriting Guarded Existential Rules into Small Datalog Programs Type Conference Proceeding Abstract Author Magdalena Ortiz Conference ICDT 2018 - 21st International Conference on Database Theory -
2018
Title Compiling Model Representations for Querying Large ABoxes in Expressive DLs Type Conference Proceeding Abstract Author Labinot Bajraktari Conference IJCAI-ECAI 2018 - 27th International Joint Conference on Artificial Intelligence and the 23rd European Conference on Artificial Intelligence -
2018
Title Relaxing and Restraining Queries for OBDA (Extended Abstract) Type Conference Proceeding Abstract Author Magdalena Ortiz Conference KR 2018 - 16th International Conference of Principles of Knowledge Representation and Reasoning -
2022
Title Reasoning Web. Declarative Artificial Intelligence, 17th International Summer School 2021, Leuven, Belgium, September 8–15, 2021, Tutorial Lectures DOI 10.1007/978-3-030-95481-9 Type Book editors Šimkus M, Varzinczak I Publisher Springer Nature -
2022
Title Repairing SHACL Constraint Violations Using Answer Set Programming Type Conference Proceeding Abstract Author Robert David Conference ISWC 2022 - 21st International Semantic Web Conference Pages 375-391 -
2022
Title Pointwise Circumscription in Description Logics Type Conference Proceeding Abstract Author Federica Di Stefano Conference DL 2022 - 35th International Workshop on Description Logics co-located with FLoC 2022 - Federated Logic Conference -
2022
Title Well-founded Semantics for Recursive SHACL Type Conference Proceeding Abstract Author Adrian Chmurovic Conference Datalog-2.0 2022 - 4th International Workshop on the Resurgence of Datalog in Academia and Industry Pages 2-13 -
2022
Title Well-founded Semantics for Recursive SHACL Type Other Author Adrian Chmurovic Conference Datalog-2.0 2022 - 4th International Workshop on the Resurgence of Datalog in Academia and Industry Pages 2-13 Link Publication -
2018
Title Relaxing and Restraining Queries for OBDA DOI 10.48550/arxiv.1808.02850 Type Preprint Author Andresel M -
2022
Title Highly Customizable Assemblies using Object-based Automated Planning DOI 10.34726/hss.2022.101480 Type Other Author Župa A Link Publication -
2022
Title Repairing SHACL Constraint Violations Using Answer Set Programming DOI 10.1007/978-3-031-19433-7_22 Type Book Chapter Author Ahmetaj S Publisher Springer Nature Pages 375-391 -
2017
Title Managing Change in Graph-Structured Data Using Description Logics DOI 10.1145/3143803 Type Journal Article Author Ahmetaj S Journal ACM Transactions on Computational Logic (TOCL) Pages 1-35 Link Publication -
2021
Title Magic Shapes for Validation in SHACL Type Conference Proceeding Abstract Author Bianca Loehnert Conference ISWC 2021 - 20th International Semantic Web Conference -
2021
Title Bounded Predicates in Description Logics with Counting Type Conference Proceeding Abstract Author Mantas Simkus Conference IJCAI 2021 - 30th International Joint Conference on Artificial Intelligence -
2021
Title Automata-based Reasoning for Decidable Logics with Data Values DOI 10.34726/hss.2021.94060 Type Other Author Labai N Link Publication -
2020
Title Querying Rich Ontologies by Exploiting the Structure of Data DOI 10.1007/s13218-020-00672-9 Type Journal Article Author Bajraktari L Journal KI - Künstliche Intelligenz Pages 395-398 Link Publication -
2020
Title Query Rewriting for Ontology-Mediated Conditional Answers DOI 10.1609/aaai.v34i03.5660 Type Journal Article Author Andresel M Journal Proceedings of the AAAI Conference on Artificial Intelligence Pages 2734-2741 Link Publication -
2020
Title Rewriting Approaches for Ontology-Mediated Query Answering DOI 10.1007/s13218-020-00671-w Type Journal Article Author Ahmetaj S Journal KI - Künstliche Intelligenz Pages 523-526 Link Publication -
2020
Title An ExpTime Upper Bound for ALC with Integers DOI 10.24963/kr.2020/61 Type Conference Proceeding Abstract Author Labai N Pages 614-623 Link Publication -
2020
Title Special Issue on Ontologies and Data Management: Part I DOI 10.1007/s13218-020-00682-7 Type Journal Article Author Schneider T Journal KI - Künstliche Intelligenz Pages 287-289 Link Publication -
2020
Title Ontologies and Data Management: A Brief Survey DOI 10.1007/s13218-020-00686-3 Type Journal Article Author Schneider T Journal KI - Künstliche Intelligenz Pages 329-353 Link Publication -
2020
Title Special Issue on Ontologies and Data Management: Part II DOI 10.1007/s13218-020-00693-4 Type Journal Article Author Schneider T Journal KI - Künstliche Intelligenz Pages 439-441 Link Publication -
2020
Title Answering Regular Path Queries Over SQ Ontologies DOI 10.48550/arxiv.2011.08816 Type Preprint Author Gutiérrez-Basulto V -
2020
Title An ExpTime Upper Bound for $\mathcal{ALC}$ with Integers (Extended Version) DOI 10.48550/arxiv.2006.02078 Type Preprint Author Labai N -
2020
Title Stable Model Semantics for Recursive SHACL DOI 10.1145/3366423.3380229 Type Conference Proceeding Abstract Author Andresel M Pages 1570-1580 -
2019
Title Pebble-Intervals Automata and FO2 with Two Orders (Extended Version) DOI 10.48550/arxiv.1912.00171 Type Preprint Author Labai N -
2019
Title Ontology-Mediated Queries from Examples: a Glimpse at the DL-Lite Case DOI 10.29007/jhtz Type Conference Proceeding Abstract Author Ortiz M Pages 1--14 Link Publication -
2019
Title Polynomial Rewritings from Expressive Description Logics with Closed Predicates to Variants of Datalog DOI 10.48550/arxiv.1912.07475 Type Preprint Author Ahmetaj S -
2018
Title Answering Regular Path Queries over SQ Ontologies DOI 10.1609/aaai.v32i1.11539 Type Journal Article Author Gutiérrez-Basulto V Journal Proceedings of the AAAI Conference on Artificial Intelligence Link Publication -
2018
Title Combining Rules and Ontologies into Clopen Knowledge Bases DOI 10.1609/aaai.v32i1.11565 Type Journal Article Author Bajraktari L Journal Proceedings of the AAAI Conference on Artificial Intelligence Link Publication -
2018
Title Improving Data Management using Domain Knowledge DOI 10.24963/ijcai.2018/814 Type Conference Proceeding Abstract Author Ortiz M Pages 5709-5713 Link Publication -
2019
Title Polynomial Rewritings from Expressive Description Logics with Closed Predicates to Variants of Datalog Type Journal Article Author Magdalena Ortiz Journal Artificial Intelligence -
2019
Title Optimizing Horn-SHIQ Reasoning for OBDA Type Conference Proceeding Abstract Author Labinot Bajraktari Conference ISWC 2019 - 18th International Semantic Web Conference Pages 75-92 -
2019
Title Relaxing and Restraining Queries for OBDA Type Conference Proceeding Abstract Author Medina Andresel Conference AAAI 2019 - 33rd Conference on Artificial Intelligence Pages 2654-2661
-
2021
Link
Title Artificial Intellicence - broadcasted event Type A broadcast e.g. TV/radio/film/podcast (other than news/press) Link Link -
2021
Link
Title Interview for TUW MAGAZINE Type A magazine, newsletter or online publication Link Link -
2021
Link
Title "Tagebuch der Informatikerin" Online - Speed Mentoring Program Type Participation in an activity, workshop or similar Link Link
-
2023
Title Keynote speaker Workshop on Logic, Language, Information and Computation (WoLLIC 2023) Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2022
Title Hedy Lamarr Prize 2022 Type Research prize Level of Recognition Regional (any country) -
2021
Title Editorial board of the Artificial Intelligence Journal (AIJ) Type Appointed as the editor/advisor to a journal or book series Level of Recognition Continental/International -
2021
Title Keynote speaker International Conference on Formal Concept Analysis (ICFCA 2021) Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2021
Title netidee SCIENCE Type Research prize Level of Recognition National (any country) -
2020
Title Frontiers in AI talk, 24th European Conference of Artificial Intelligence (ECAI) Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2020
Title Keynote speaker at International Workshop on Description Logics (DL 2020) Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2019
Title Guest editor for the Special Issue on Ontologies and Data Management, published by Springer in the German Journal of Artificial Intelligence Type Appointed as the editor/advisor to a journal or book series Level of Recognition Continental/International -
2019
Title Keynote speaker at the Alberto Mendelzon Workshop on Foundations of Data Management (AMW 2019) Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International -
2019
Title Best student paper award at DL 2019 Type Research prize Level of Recognition Continental/International -
2019
Title Steering Committee Member at the International Workshop on Description Logics Type Prestigious/honorary/advisory position to an external body Level of Recognition Continental/International -
2018
Title Early-Career Spotlight talk International Joint Conference on Artificial Intelligence (IJCAI) Title: Improving Data Management using Domain Knowledge Type Personally asked as a key note speaker to a conference Level of Recognition Continental/International
-
2020
Title Concept recognition in monitoring data Type Research grant (including intramural programme) Start of Funding 2020 Funder Wirtschaftsagentur Wien -
2021
Title Foundations of Reasoning in the Shape Constraint Language Type Other Start of Funding 2021 Funder Austrian Science Fund (FWF)