CMOMgen: Complex Multi-Ontology Alignment via Pattern-Guided In-Context Learning

AI in healthcare
Published: arXiv: 2510.21656v1
Authors

Marta Contreiras Silva Daniel Faria Catia Pesquita

Abstract

Constructing comprehensive knowledge graphs requires the use of multiple ontologies in order to fully contextualize data into a domain. Ontology matching finds equivalences between concepts interconnecting ontologies and creating a cohesive semantic layer. While the simple pairwise state of the art is well established, simple equivalence mappings cannot provide full semantic integration of related but disjoint ontologies. Complex multi-ontology matching (CMOM) aligns one source entity to composite logical expressions of multiple target entities, establishing more nuanced equivalences and provenance along the ontological hierarchy. We present CMOMgen, the first end-to-end CMOM strategy that generates complete and semantically sound mappings, without establishing any restrictions on the number of target ontologies or entities. Retrieval-Augmented Generation selects relevant classes to compose the mapping and filters matching reference mappings to serve as examples, enhancing In-Context Learning. The strategy was evaluated in three biomedical tasks with partial reference alignments. CMOMgen outperforms baselines in class selection, demonstrating the impact of having a dedicated strategy. Our strategy also achieves a minimum of 63% in F1-score, outperforming all baselines and ablated versions in two out of three tasks and placing second in the third. Furthermore, a manual evaluation of non-reference mappings showed that 46% of the mappings achieve the maximum score, further substantiating its ability to construct semantically sound mappings.

Paper Summary

Problem
The main problem this paper addresses is the challenge of integrating multiple ontologies into a single cohesive semantic layer to capture nuanced relationships across domains. This is essential for constructing comprehensive knowledge graphs that provide full coverage for all necessary domains.
Key Innovation
The key innovation of this paper is the introduction of CMOMgen, the first end-to-end Complex Multi-Ontology Matching (CMOM) strategy that generates complete and semantically sound mappings without any restrictions on the number of target ontologies or entities. CMOMgen uses a retrieval-augmented generation framework to select relevant classes and reference examples, enhancing in-context learning.
Practical Impact
This research has significant practical impact in the real world, particularly in the field of knowledge graph construction and ontology alignment. CMOMgen can be applied to various domains, such as biomedical research, where integrating multiple ontologies can help create more comprehensive and accurate knowledge graphs. This can lead to improved information retrieval, knowledge discovery, and decision-making.
Analogy / Intuitive Explanation
Imagine you're trying to connect different libraries in a city to create a single, comprehensive catalog of books. Each library has its own catalog system, and some books might have the same title but different authors or editions. Ontology matching is like finding the equivalent books across different libraries. CMOMgen is like a super-smart librarian who can not only find the equivalent books but also understand the relationships between different books and authors, creating a more nuanced and accurate catalog.
Paper Information
Categories:
cs.AI
Published Date:

arXiv ID:

2510.21656v1

Quick Actions