What future works have the authors mentioned in the paper "Minimizing conservativity violations in ontology alignments: algorithms and evaluation" ?

In order to mitigate incompleteness, the authors plan to study extensions of their techniques to more expressive logical fragments, while keeping the current scalability properties. Nevertheless the authors plan to explore alternative methods to address the conservativity violations. For example, domain experts could be involved in the assessment of the additional disjointness [ 20, 35 ], and to suggest extensions to the input ontologies [ 31 ] for violations recognised as false positives. The authors consider, however, that the proposed methods have also potential in scenarios others than Optique.

What are the common ontology alignment repair systems?

State-of-the-art ontology alignment repair systems, such as ALCOMO [54], AML [66], ASMOV [32], Lily [85], LogMap [33], and YAM++ [60], typically consider the input ontologies as immutable and their repair techniques focus on the mappings.

What is the impact of alignment repair?

The impact of alignment repair is computed as the percentual of gain (resp. loss for negative values) for each measure computed for a repaired alignment, compared to the same measure computed for the original alignment.

What is the principle of conservativity in ontology alignment?

The conservativity principle in ontology alignment aims at capturing the differences in the ontology classification between the input ontologies and the aligned ontology [36] (i.e., new subsumptions and/or new equivalences among concepts).

What is the simplest way to encode the structural index of ontologies?

Given that queries over the structural relationships of ontologies are heavily employed in their approach, the authors rely on the optimized structural index of LogMap [33, 39], based on the interval labelling schema techniques presented in [1]

What is the definition of equivalence violations?

In addition, the authors also define violations between concepts that may have been already involved in a subsumption relationship (i.e., resulting in an equivalence between them), denoted as equivalence conservativity principle violations, or simply equivalence violations.

How many entities were considered not equivalent?

In a later release of such ontology, 15 entities were merged, while 18 were judged as not equivalent by domain experts (NCI ontology curators).

How is the graph representation of the ontology created?

The graph representationG of the aligned ontology w.r.t.O1,O2 andM, is built by means of createDigraph function (line 1 of Algorithm 1).

What is the sum of the detection and repair time of EqRepair?

The experimental results considering EqRepair algorithm, can be summarized as follows:(i) The sum of the detection and repair time of EqRepair is very low due to the linear cost of the detection technique and the efficient parallelization of the diagnosis computation.(ii)

What is the corrective strategy for the ontology?

The correction strategy aims at adding to the input ontologies a minimal set of axioms, so that the input ontologies (in isolation) can entail the novel axiom (solving, in this way, the violation).

What is the definition of a diagnosis for a graph representation of an ontology?

In Definition 4.4 the authors formalize a diagnosis as the set of arcs of the graph representation of an aligned ontology that, once removed, breaks all the unsafe cycles.

What is the average size of the repairs?

The computed repairs are typically of limited size (less than 10%), but can reach a significant portion of the the original alignment.

What is the simplest way to characterize a restricted version of the conservativity principle?

Starting from the results of Proposition 4.2, the authors can characterize a restricted version of the conservativity principle using graph-theoretical concepts only, applied on the graph representation, without the need to refer to the aligned ontology.

What is the mapping repair algorithm for the extended horn propositional formulas?

The step 8 of Algorithm 2 uses the mapping (incoherence) repair algorithm of LogMap, for the extended Horn propositional formulas Pd1 and Pd2 , and the input mappings M. The mapping repair process exploits the Dowling-Gallier (D&G) algorithm [14, 23] for propositional Horn satisfiability (refer to [73], Section 6.3, for more details) and checks, for every propositionA of a given formula P , the satisfiability of the propositional formula PA = P ∪ {> →

(Open Access) Minimizing conservativity violations in ontology alignments: algorithms and evaluation (2017) | Alessandro Solimando

Q: What have the authors contributed in "Minimizing conservativity violations in ontology alignments: algorithms and evaluation" ?

In this paper, the authors present an approach to detect and minimize the violations of the so-called conservativity principle where novel subsumption entailments between named concepts in one of the input ontologies are considered as unwanted.

City, University of London Institutional Repository

Citation: Solimando, A., Jimenez-Ruiz, E. and Guerrini, G. (2017). Minimizing

conservativity violations in ontology alignments: algorithms and evaluation. Knowledge and

Information Systems, 51(3), pp. 775-819. doi: 10.1007/s10115-016-0983-3

This is the accepted version of the paper.

This version of the publication may differ from the final published

version.

Permanent repository link: https://openaccess.city.ac.uk/id/eprint/22961/

Link to published version: http://dx.doi.org/10.1007/s10115-016-0983-3

University of London available to a wider audience. Copyright and Moral

Rights remain with the author(s) and/or copyright holders. URLs from

City Research Online may be freely distributed and linked to.

Reuse: Copies of full items can be used for personal research or study,

educational, or not-for-profit purposes without prior permission or

charge. Provided that the authors, title and full bibliographic details are

credited, a hyperlink and/or URL is given for the original metadata page

and the content is not changed in any way.

City Research Online: http://openaccess.city.ac.uk/ publications@city.ac.uk

City Research Online

Under consideration for publication in Knowledge and Information Systems

Minimizing Conservativity Violations in

Ontology Alignments: Algorithms and

Evaluation

Alessandro Solimando

, Ernesto Jim

enez-Ruiz

, Giovanna Guerrini

DIBRIS, Informatica, Bioingegneria, Robotica e Ingegneria dei Sistemi; University of Genova; Italy

Department of Computer Science; University of Oxford; United Kingdom

Abstract. In order to enable interoperability between ontology-based systems, ontology match-

ing techniques have been proposed. However, when the generated mappings lead to undesired

logical consequences, their usefulness may be diminished. In this paper, we present an approach

to detect and minimize the violations of the so-called conservativity principle where novel sub-

sumption entailments between named concepts in one of the input ontologies are considered as

unwanted. The practical applicability of the proposed approach is experimentally demonstrated

on the datasets from the Ontology Alignment Evaluation Initiative.

1. Introduction

Ontologies play a key role in the development of the Semantic Web and are being used

in many diverse application domains, ranging from biomedicine to energy industry. An

application domain may have been modeled with different points of view and purposes.

This situation usually leads to the development of different ontologies that intuitively

overlap, but they use different naming and modeling conventions.

The problem of (semi-)automatically computing mappings between independently

developed ontologies is usually referred to as the ontology matching problem. A num-

ber of sophisticated ontology matching systems have been developed in the last years

[16, 71]. Ontology matching systems, however, rely on lexical and structural heuristics

and the integration of the input ontologies and the mappings may lead to many undesired

logical consequences. In [36] three principles were proposed to minimize the number of

potentially unintended consequences, namely: (i) consistency principle, the mappings

should not lead to unsatisﬁable concepts in the integrated ontology, (ii) conservativity

Received xxx

Revised xxx

Accepted xxx

2 A. Solimando et. al

principle, the mappings should not introduce new semantic relationships between con-

cepts from one of the input ontologies, (iii) locality principle, the mappings should link

entities that have similar neighbourhoods.

These alignment principles have been actively investigated in the last years (e.g.,

[32, 33, 36, 54, 56, 57, 66]). Violations to these principles are frequent, even in the

reference mapping sets and the alignments generated by the best performing match-

ers of the Ontology Alignment Evaluation Initiative

(OAEI). Also manually curated

alignments, such as the UMLS Metathesaurus [5] (UMLS),

a comprehensive effort for

integrating biomedical knowledge bases, suffer from these violations [36]. The occur-

rence of these violations may hinder the usefulness of ontology mappings. The practical

effect of these violations is clearly evident when ontology alignments are involved in

complex tasks such as query answering [54, 78]. The undesired logical consequences

caused by violations can either prevent query answering, or cause incorrect results. In

order to reduce existing violations, alignment repair methods typically remove a subset

of the alignment, given that input ontologies are considered as immutable, a common

setting in ontology alignment repair scenarios.

It should be noted, however, the different nature of the alignment principles. Vio-

lations of the consistency principle, unlike violations of the conservativity and locality

principles, always lead to an undesired logical consequence (i.e., unsatisﬁability of a

concept) and they should always be avoided. Conservativity and locality violations may

also lead to undesired logical consequences; however they may also represent false pos-

itives and reveal incompleteness in one of the input ontologies. In Section 8 we discuss

alternative approaches that suggest to ﬁx the input ontologies instead of repairing the

alignment (e.g., [7, 48]).

In this paper we focus on the conservativity violations and we follow a “better safe

than sorry” approach (i.e., we treat violations as undesired consequences led by the

mappings). Conservativity violations are presented in two ﬂavours, namely subsumption

violations and equivalence violations. The (potential) challenging number of conserva-

tivity violations requires to exploit the intrinsic characteristics of these two ﬂavours,

that result in the development of different approaches for their repair. The detection

and correction of subsumption violations relies on the assumption of disjointness [67]

and it is reduced to a consistency principle violation problem; while equivalence vi-

olations are addressed using a combination of graph theory and logic programming.

These two methods are combined into a multi-strategy approach addressing both types

of violations. Our extensive evaluation supports the effectiveness of the individual and

combined approaches in the detection and correction of conservativity violations.

The present paper extends [74, 75] under the following aspects: all the experimental

evaluations provided here cover both reference alignments and alignments computed

by participating systems of the OAEI 2012–2014 campaigns, where previous papers

covered only the reference alignments of the OAEI. Compared to [75], the present article

fully details the proposed method, including a correctness proof of the technique for

adding disjointness clauses to Horn Propositional formulas, on which our technique

heavily relies. Furthermore, [75] only dealt with the subsumption violations ﬂavour,

while in this paper we also cover in detail the equivalence violations ﬂavour. Concerning

[74],

all the technical details and proofs are now provided. In addition, the results of

the evaluation of the two possible variants of our combined repair approach are now

http://oaei.ontologymatching.org/

Alignments from UMLS are extracted according to the method deﬁned in [36].

This paper was presented in a workshop without formal proceedings.

Minimizing Conservativity Violations in Ontology Alignments 3

analyzed, as well as the results for the independent techniques in isolation, that can be

used as baseline results. Finally, an empirical assessment of the impact of our repair

methods on the alignment quality (in terms of precision, recall and f-measure) is now

provided.

The remainder of the paper is organised as follows. Section 2 summarises the basic

concepts and deﬁnitions we will rely on along the paper. In Section 3 we introduce

our motivating scenario. Section 4 formally states the problem of computing repairs

for equivalence violations and presents an algorithm to solve such violations. Section 5

describes the method and algorithm to solve subsumption violations. Section 6 details

additional properties of the proposed methods. In Section 7 we present the conducted

evaluation. A comparison with relevant related work is provided in Section 8. Finally,

Section 9 gives some conclusions and future work lines.

2. Preliminaries

In this section, we provide the necessary deﬁnitions and notions that will be used in

the subsequent sections. Section 2.1 brieﬂy introduces OWL 2 and the main elements

in an ontology. In Section 2.2 we give a formal deﬁnition of ontology mapping and on-

tology alignment (adapted from [17]) with their semantics. In Section 2.3 we precisely

deﬁne the semantic consequences imposed by ontology alignments, and we formalize

the consistency and conservativity principles. Finally, Section 2.4 covers the necessary

preliminaries about graph theory.

2.1. Ontologies and OWL 2

Ontologies play a key role in the development of the Semantic Web and are being used

in many diverse application domains, ranging from biomedicine to energy industry.

The most widely used ontology modelling language is the OWL 2 Web Ontology Lan-

guage [11], which is a World Wide Web Consortium (W3C) recommendation [84].

Description Logics (DL) are the formal underpinning of OWL 2 [3, 30].

An OWL 2 ontology O is equipped with a signature Sig(O), that is a vocabulary

of legal names for the entities appearing in the ontology. Sig(O) is composed by the

disjoint union of four ﬁnite sets: (i) N

, a set of unary symbols called named concepts,

(ii) N

, a set of binary symbols called named object properties, (iii) N

, a set of bi-

nary symbols called data properties, (iv) N

, a set of constant symbols called named

individuals.

OWL 2 ontologies can be seen as a set of axioms that are conformant to the syn-

tactic rules and constraints imposed by their underlying DL language [30], and built

using the elements of the signature. The classiﬁcation of O, denoted as Cl(O), corre-

sponds to the result of the computation, performed using an OWL 2 reasoner, of the full

subsumption/subconcept relation between its named concepts (i.e., elements of N

Classiﬁcation is therefore the subset of the logical closure of an ontology O s.t. each

axiom is of the form A v B, where A, B ∈ N

(O) and O |= A v B.

2.2. Ontology Mappings and Alignments

Ontology Mappings. In Deﬁnition 2.1 we provide the deﬁnition of ontology mapping

(also called match or correspondence).

4 A. Solimando et. al

Deﬁnition 2.1. Consider two input ontologies O

, O

, and their respective signature

Sig(O

) and Sig(O

). A mapping between entities of O

, O

is a 4-tuple he, e

, r, ci

such that e ∈ Sig(O

) and e

∈ Sig(O

), r ∈ {v, w, ≡}

is a semantic relation, and

c is a conﬁdence value. Usually, the real number unit interval (0 . . . 1] is employed for

representing conﬁdence values. Mapping conﬁdence intuitively reﬂects how reliable a

mapping is (i.e., 1 = very reliable, 0 = not reliable).

Ontology Alignment. Deﬁnition 2.2 introduces the notion of alignment.

Deﬁnition 2.2. An alignment M between two ontologies, namely O

, O

, is a set of

mappings between O

and O

The main format to represent mappings have been proposed in the context of the

Alignment API, and it is called RDF Alignment [12]. This format is the standard for

the well-known OAEI campaign. In addition, mappings are also represented as standard

subclass and equivalence DL axioms. When mappings are expressed through OWL 2

axioms, conﬁdence values are represented as OWL 2 axiom annotations [35]. The rep-

resentation through standard OWL 2 axioms enables the reuse of the extensive range of

OWL 2 reasoning infrastructure that is currently available. We adopt this representation,

and in the remainder of the paper we consider alignments as set of OWL 2 axioms.

Deﬁnition 2.3 introduces the notion of aligned ontology, resulting from the integra-

tion of two input ontologies, through an alignment between them.

Deﬁnition 2.3. Let O

, O

be two (input) ontologies, and let M be an alignment be-

tween them. The ontology O

= O

∪ O

∪ M is called the aligned ontology w.r.t.

, O

, and M.

is simply called the aligned ontology when no confusion arises. Note that

we assume that the signature of the aligned ontology is always the union of the signa-

tures of the input ontologies. When the input ontologies are clear from the context we

employ the abbreviated notation O

Given that each mapping is translated into an OWL 2 axiom, the aligned ontology is

again an OWL 2 ontology. Note that alternative formal semantics for ontology mappings

have been proposed in the literature, such as those proposed by Zimmermann et al.

in [87], and the semantics associated to the so-called bridge rules, in the context of

distributed description logics [6, 55].

2.3. Semantics of the Integration and Principles for Ontology Alignments

This section introduces the semantics of the integration, and provides a formal charac-

terization of the consistency and conservativity principles in ontology alignment.

Semantic Consequences of the Integration. The ontology resulting from the integra-

tion of two ontologies O

and O

via an alignment M may entail axioms that do not

follow from O

, O

, or M alone. These new semantic consequences can be captured

by the notion of deductive difference [46, 47].

Intuitively, the deductive difference between O and O

, w.r.t. a signature Σ, is the set

of entailments constructed over Σ that do not hold in O, but do hold in O

. The notion

We exclude disjointness from the semantic relations given that most of the available systems do not compute

this relation. Negative constraints are typically harder to identify and assess than positive ones [20].

Minimizing conservativity violations in ontology alignments: algorithms and evaluation

Figures

Citations

Ontology Based Data Access in Statoil

From Polynomial Procedures to Efficient Reasoning with EL Ontologies

Results of the Ontology Alignment Evaluation Initiative 2019

LogMap family participation in the OAEI 2017

Large-Scale Ontology Matching: State-of-the-Art Analysis

References

Depth-First Search and Linear Graph Algorithms

The Unified Medical Language System (UMLS): integrating biomedical terminology

A theory of diagnosis from first principles

Ontology Matching

Similarity flooding: a versatile graph matching algorithm and its application to schema matching

Related Papers (5)

Logic−based Assessment of the Compatibility of UMLS Ontology Sources

Alignment Incoherence in Ontology Matching

LogMap: logic-based and scalable ontology matching

Ontology Matching: State of the Art and Future Challenges

The AgreementMakerLight Ontology Matching System

Frequently Asked Questions (16)

Q1. What have the authors contributed in "Minimizing conservativity violations in ontology alignments: algorithms and evaluation" ?

Q2. What future works have the authors mentioned in the paper "Minimizing conservativity violations in ontology alignments: algorithms and evaluation" ?

Q3. What are the common ontology alignment repair systems?

Q4. What is the impact of alignment repair?

Q5. What is the principle of conservativity in ontology alignment?

Q6. What is the simplest way to encode the structural index of ontologies?

Q7. What is the definition of equivalence violations?

Q8. What are the three principles proposed to minimize the number of potentially unintended consequences?

Q9. How many entities were considered not equivalent?

Q10. How is the graph representation of the ontology created?

Q11. What is the sum of the detection and repair time of EqRepair?

Q12. What is the corrective strategy for the ontology?

Q13. What is the definition of a diagnosis for a graph representation of an ontology?

Q14. What is the average size of the repairs?

Q15. What is the simplest way to characterize a restricted version of the conservativity principle?

Q16. What is the mapping repair algorithm for the extended horn propositional formulas?