Ontology Stratification Methods: a Comparative
Study?
Ricardo Guimarães1 , Uli Sattler2 , and Renata Wassermann1
1
University of São Paulo, São Paulo-SP, Brazil {ricardof, renata}@ime.usp.br
University of Manchester, Manchester, UK
[email protected]
2
Abstract. Comparing the importance of axioms in ontologies is an essential task in a variety of applications such as ontology repair and inconsistency management. It guides the choice of axioms to remain in the
ontology or which should prevail when a conflict arises. While evaluating
the importance of an axiom is difficult, there are different approaches to
stratify ontologies by criteria that act as proxies for importance. However, there is little work about how these methods are related and their
adequacy considering different applications. In this work, we evaluate
specificity and modularity-based stratification approaches, and work towards understanding how they relate. We compare empirically the result
of five stratification methods over a corpus of real-world ontologies. In
particular, we investigate correlations between their rankings and their
ability to distinguish axioms.
Keywords: Stratification · Ontology modularity · Ontology evolution.
1
Introduction
When managing knowledge bases, users and designers are often confronted with
the need to choose among conflicting pieces of information. While the most
reliable solution may be to delegate the choice to a domain specialist with information about the particular case at hand, this is not always possible, and
thus many approaches propose the use of stratification methods. Stratified or
prioritized knowledge bases are such that each axiom has a value that denotes
its importance.
In the absence of further information, an axiom’s importance is closely related to its generality (or specificity), where none of these notions has a commonly accepted definition for complex axioms. Being able to stratify an ontology
by the importance of its axioms, will enable automatic repair algorithms [4] or
inconsistent-tolerant [14] reasoning algorithms to choose to preserve more important information and ignore/drop less important one. a pre-determined criterion.
The idea of stratification or prioritization to solve choice problems in knowledge bases is not new. It has appeared in repair and evolution of databases [9],
?
The first author has been partially supported by CAPES through grant 001 and
partially by FAPESP through grants 2017/04410-0 and 2018/00087-2.
of classical knowledge bases [12] and of ontologies [14, 4]. In Belief Revision,
the area of Knowledge Representation that studies change over logical theories,
Nebel [12] claims that its impossible to completely define a formal revision operation without resorting to some preference ordering. In the context of default
reasoning, Baader and Hollunder [1] discuss and propose approaches to prioritization in terminologies to encode the preference for more specific knowledge
when dealing with inconsistencies.
In this paper we consider OWL 2 DL ontologies [7] and regard them as finite
sets of axioms in SROIQ(D). While some approaches to prioritize axioms in
these ontologies have been proposed [1, 14, 5, 10], to the best of our knowledge,
there is no study that compares stratification techniques. Moreover, there is no
work regarding how to measure the quality of a stratification procedure.
The contributions described in this paper are two-fold: firstly, we devise metrics to measure the ability of these methods to distinguish between the axioms
of an ontology and suitable statistical methods to measure correlation between
these approaches. Secondly, we apply this framework to five different stratification methods and a corpus of existing ontologies, thereby gaining novel insights
into their differences, strengths and weaknesses.
Of the five methods selected for this study, two encode specificity and three
rely on the modular structure of the ontologies. Besides measuring correlation
and axiom dispersion of these methods, we also discuss other aspects such as
restrictions to applicability, feasibility and significance of the rankings produced.
While we do not validate or compare directly proxies for importance, such as,
specificity, generality and module preservation we will often refer to them to
understand the stratification methods considered. It turns out that the methods
perform very differently according to our metrics, and there are some interesting
correlations between some stratification methods.
This paper is accompanied by a technical report with more details about the
implementation of the experiments and additional results [11].
2
Preliminaries
Before detailing the methods we evaluate, we present in this section some fundamental concepts and establish conventions used through this work. We also
assume the reader to be familiar with description logics and its usual notation
(for more details we refer the reader to [2], for example).
We call a set of terms (concept, role and individual names) a signature.
Moreover, if O is an ontology, its signature ( sig(O) ) is the set of terms occurring
in O. We extend the notion and definition for axioms and sets of axioms.
Since we consider ontologies here as sets of formulas (bases) in a description
logic, we follow the approach adopted by Benferhat et al. [4] (but considering
all axioms and not only assertions) and define a stratified ontology O to be a
partitioned set of axioms: O = {Oi }0≤i≤n (with Oi ∩Oj = ∅, if i 6= j). We decide
by convention that the most important axioms are those at the strata with the
lowest index, that is, if i < j, then the axioms in Oi are more important than (or
preferred over) the axioms in Oj . Moreover, if an axiom α is at the stratum Oi ,
we say that it has rank i (rank(α) = i). Through this paper we use the notion
of ranks and partitions interchangeably.
The stratification approaches we discuss in Section 3 are induced by a partial
order of the axioms. Formally, if S is a set and ≺ a partial order on S then, the
stratification of S induced by ≺ is such that, for each a ∈ S:
0,
if a is minimal w.r.t. ≺
rank(a) =
1 + max{b∈O|b≺a} rank(b),
otherwise
2.1
Modularity
In this subsection, we briefly discuss the modularity approach underlying the
stratification methods that we selected for comparison.
The syntactic locality-based modules (LBMs) [6] constitute a modularity
approach which not only has crucial properties for reuse, but also is polynomially extractable. These characteristics, together with its availability through the
OWL API3 explain its current popularity.
Given a signature Σ, the locality-based module of an ontology O for the
signature Σ is a subset of the ontology that has the same entailments as the
ontology over Σ and all other terms that compose the module’s signature. There
are two basic flavours of LBMs: >-locality and ⊥-locality. They differ in the
syntactical rules applied to decide whether an axiom belongs to a module (due
to space constrains, we refer the reader to [15] for the actual definition). For
x ∈ {⊥, >}, we denote the x-locality-based module of the ontology O for the
signature Σ by x-LBM(O, Σ).
The atomic decomposition (or simply AD) of an ontology represents the set
of all locality-based modules of an ontology. The AD of an ontology consists of
a partitioning of the axioms (each partition called atom), according to their cooccurrence in LBMs and a dependency relation between these, that is, a partial
order indicating that whenever an atom is a subset of a module, so are all atoms
that it depends on. As with LBMs, we denote the atomic decomposition of the
ontology O for the locality notion x ∈ {⊥, >} by x-AD(O). Each atom in the
AD, together with the atoms it depends on, forms a genuine module, that is,
a locality-based module that is minimal w.r.t. set inclusion [13]. To clarify the
definition of the atomic decomposition, consider the illustrative ontology KoalaS,
a modified version of the ontology Koala [8], shown in the first column of Table 1
(the whole table will be discussed in Section 3) and its atomic decompositions
in Figures 1a and 1b.
3
Stratification Methods
In this section, we present the stratification methods selected for comparison.
We choose these methods because they only use the input ontology as parameter
and can be computed without human assistance.
3
http://owlcs.github.io/owlapi/
Table 1: The KoalaS ontology and five distinct rankings of its axioms
Axiom
AxS CNS BAH BAD TAH
α1 : Animal v ∃hasGender.Gender
4
1
1
3
2
α2 : Person v Animal
3
1
2
2
1
α3 : Person u Marsupial v ⊥
1
1
3
1
0
α4 : Marsupial v Animal
3
2
2
2
1
α5 : Koala v Marsupial
2
1
3
1
0
α6 : KoalaWithPhD ≡ Koala u ∃hasDegree.{PhD}
1
0
4
0
0
α7 : ∃hasDegree.> v Person
2
1
3
1
0
α8 : Degree(MS )
0
1
0
4
1
α9 : Degree(PhD)
0
1
0
4
1
α6
α7
>
α3
α2
α5
α1
α8
α9
α4
α2
α4
Animal
Marsupial
α1
α3 , α5
α6 , α7
α8
α9
(a) ⊥-AD of KoalaS
(b) >-AD of KoalaS
Koala
Person
∃hasDegree.> Degree Gender
KoalaWithPhD
(c) Inferred concept hierarchy
Fig. 1: Atomic decompositions and inferred concept hierarchy of KoalaS
3.1
Axiom Specificity
Axiom specificity (AxS) was described by Qi and Pan [14], based on the work
of Benferhat, Baida and Cuppens [3] that used this notion of importance in
access-control applications. A similar notion was applied in default logic terminologies [1]. This criterion considers the most specific axioms as the most
important ones. According to the definition in [14] α1 = C1 v D1 is more specific
than α2 = C2 v D2 (written α1 ≺AxS α2 ) in O if O C1 v C2 , but O 6 C2 v C1 .
Therefore, AxS is the stratification of axioms induced by ≺AxS .
We extend the definition of of specificity (≺AxS ) in [14] to other types of
axioms, such as role (chain) inclusions for example, using an analogous methodology for comparison (e.g. check inclusion between the left-hand sides of two
axioms). ABox axioms (concept and role assertions) are minimal axioms, as suggested by Cóbe [5]. That is, given an ontology O, if α is a class or role assertion,
then α is minimal w.r.t. ≺AxS . Excluding relations with ABox axioms, axioms
from different types are incomparable.
Given the differences between SROIQ(D) axioms and their OWL 2 DL
equivalents, in our implementation we handled each axiom type in a straightforward way, more details are included in the technical report [11].
This method is syntax dependent, i.e., two ontologies O1 and O2 can have
equivalent axioms (for every α1 ∈ O1 there exists α2 ∈ O2 , such that α1 ≡ α2 ,
and vice versa), and yet have distinct stratifications. It is possible to change the
rank of an axiom by rewriting it in another way. Consider the axiom A u ∃r.B v X,
a simple absorption, a common optimization in reasoners, would rewrite it equivalently as A v ∀r.¬B t X that is (usually) less specific than its original version.
Moreover, this criterion ignores the right-hand side of axioms, hence in the ontology O = {A v C1 , B v C2 , C1 v C2 }, the axioms A v C1 and B v C2 would be
placed in the same stratum, even though we could say that the first one is more
specific than the because since A and C1 are subconcepts of C2 [5].
The dependency on classification to compute AxS also interferes with their
application on inconsistent ontologies, as the hierarchies collapse. But still, it
can be adapted to such cases [14].
3.2
Concept Name Specificity
We also define an alternative to AxS. First, we establish a partial order of concepts: C is more specific than D (C ≺v D) in O if O C v D, but O
6 D v C. Let
rank(C) be the rank of the concept name C for all concept names in O in the stratification obtained from ≺v , then we say that α1 is more concept-specific than α2
(written α1 ≺CNS α2 ) in O if minC∈ sig(α1 ) rank(C) < minC∈ sig(α2 ) rank(C). The
concept name specificity (CNS) is the stratification of axioms induced by ≺CNS .
Even though this approach does not display the same syntax sensitivity as
axiom specificity for absorption, it fails to capture differences between any pair of
axioms whose most specific concepts are at the same level of specificity. Moreover,
it also has the same issues when applied to inconsistent ontologies as AxS. This
dependency on the hierarchy of concept names also implies that the axioms
without concepts (such as role inclusions) cannot be ranked by this method.
One notable way in which it differs from AxS is the handling of the assertions:
instead of considering all of them equally important, it also considers concepts
involved. For instance if we have O = {A v B, A(a), B(b)}, the axioms A(a) and
B(b) would attain the same rank by axiom specificity, however CNS would tell
us that A(a) is more specific than B(b).
3.3
x-AD Height
The last stratification method we consider, proposed by Guimarães and Wassermann [10], is based on the atomic decomposition (discussed in Subsection 2.1).
We use the partial order determined by the dependency relation in the x-AD of
O: α1 is x-preferred to α2 (written α1 ≺x-AD α2 ) in O if a1 ≺ a2 in x-AD(O),
where ai is the atom that contains the axiom αi (since each axiom belong to
exactly one atom, this relation is well-defined). The x-AD height is the stratification of axioms induced by ≺x-AD . To shorten the notation, we will refer to
⊥-AD height as BAH and >-AD height as TAH.
In [10], the authors focus on >⊥∗ -locality (other type of locality) to define
height, since it produces smaller modules than > or ⊥. However, we restrict
ourselves to > and ⊥-locality, since we are more interested in smaller atoms and
good dependency relations than in small modules. Compared to >⊥∗ -locality,
> and ⊥-locality preserve more information about class hierarchies that seem
to be beneficial for specificity/generality and importance analysis [8] (e.g. the
>⊥∗ -AD of a pure taxonomy has only unconnected atoms). In particular, for
⊥-locality we have that the module preserves all superclass relationships for the
concepts in its signature. In this way, axioms from lower atoms in the ⊥-AD tend
to deal with more general concepts, while higher deal with more specific ones,
and the opposite happens with >-AD. Hence, we expect that BAH displays a
strong negative correlation with specificity-based metrics, while TAH displays a
positive one. To get a clearer picture, we include also the ⊥-AD depth (BAD) in
our investigation; it is defined analogously to ⊥-AD height, but replacing ≺⊥-AD
with ⊥-AD in the definition of rank.
The height is a way to estimate which axioms participate of more or fewer
modules. An axiom with low height is more likely to be part of more genuine modules and thus also of more general modules, via the dependency relations [10].
Proposition 1 considers the case of removal of an axiom and the question whether
modules can be preserved and shows that this metric acts as proxy for importance, if one intends to preserve as many modules as possible.
Proposition 1. Let O be an ontology, x ∈ {⊥, >} a locality notion, α ∈ O
an axiom and Σ ⊆ sig(O) a signature such that α 6∈ x-LBM(O, Σ). Then,
x-LBM(O \ {α}, Σ) = x-LBM(O, Σ).
This proposition is an immediate consequence of the fact that x-locality w.r.t.
Σ (and any of its extensions) during the computation of x-LBMs, is decided on
a “per axiom” basis. As a consequence of Proposition 1, if α ≺x-AD β then the
removal of α affects more x-LBMs than the removal of β. Therefore, axioms with
lower heights can be said to be more influential, thus more important.
As the locality-based modules and atomic decomposition are syntax-dependent,
so are the x-AD heights and depths of the axioms. Therefore, all approaches we
consider are syntax-dependent: AxS by ignoring the right-hand sides, CNS by
considering only at the most specific concept in the class signature and the ones
based on the atomic decomposition due to the definition of syntactic locality.
To conclude this section, we compare the selected stratification methods illustratively with Example 1, which uses the ontology KoalaS from Table 1 and
that is inspired by the ontology repair scenario.
Example 1. Consider the ontology KoalaS shown in Table 1 and suppose that
we would like to restore coherence (the concept KoalaWithPhD is unsatisfiable)
by removing a minimal set of axioms. Therefore, the candidates for removal
are: α3 , α5 , α6 and α7 . Removing any of them is enough to make the ontology
coherent. TAH is unable to distinguish between them. Instead, if we follow axiom
specificity, we would either remove α5 or α7 , using CNS or BAD we would remove
any of them, except α6 , as KoalaWithPhD becomes the most specific concept due
to its unsatisfiability. In contrast, since BAH is more likely to favour more general
axioms (see Figure 1), it would tell us to remove (or fix) α6 .
In Table 1 and Figure 1 we also observe that BAH and AxS are mostly
inverses of each other as we expected, if we ignore the concept assertions α8 and
α9 . Also, BAD is very similar to AxS and CNS. In Section 5, we will see whether
these correlations hold in general.
4
Methodology
Our first methodological step is to formulate the research questions we aim
to answer. The first question is due to the fact that all stratification methods
produce partial orders and thus, do not guarantee that all the axioms have
different ranks. It consists in determining how well these methods separate the
axioms, i.e., how likely a stratification approach is to help us pick a suitable
axiom for a certain task. The second, is to verify if these methods are correlated.
Regarding dispersion, for each method, we are interested in measuring the
number of strata in the whole corpus, their relative size and how many pairs
of equivalent axioms exist after stratification. The number of strata and their
distribution by relative size (the percentage of axioms from the original ontology
inside a stratum) show how often small and large strata occur for each method.
The indistinguishable axioms rate (IAR) of a stratified ontology is the proportion
of pairs of axioms in the ontology that have axioms with the same rank, i.e., it
measures how far the ranking is from a strict total order (the 0 IAR situation).
In the case of the ontology KoalaS from Table 1, we have for ⊥-AD height:
5 strata, with the smaller strata with approximately 11.11% of the axioms and
the largest with around 33.33% of the axioms, and an IAR of roughly 0.1389.
For the second question, we test correlation between methods in two ways.
First, we look at each ontology and measure the correlation between the rankings in each. Second, we normalize the ranks of the axioms in each ontology to
the [0, 1] interval, join the results for the whole corpus and only then measure
correlation. We call them local correlation and corpus correlation, respectively.
We choose Spearman’s rank-order coefficient to measure correlation. This is
a metric of monotonic association between two rankings that does not make
assumptions about the distribution of the input data (i.e. is non-parametric).
Its value ranges from −1 to 1 and expresses whether the association is direct or
inverse. Values close 0 indicate weak or absent correlation. We will also refer to
the p-values associated to the correlation measurements. The p-value indicates
the probability of obtaining the same coefficient given that there is no monotonic
association between the rankings, the null hypothesis for the Spearman’s test.
Smaller p-values mean that we have more grounds to reject the null hypothesis,
thus the correlation is more statistically significant.
4.1
Experiment Design
We want to answer our two research questions by looking at the stratification over
a corpus of realistic ontologies. Moreover, as we rely on reasoning to compute
AxS and CNS, and on current module extraction algorithms, we only handle
legal OWL 2 DL ontologies.
Among the possible corpus options, the set of ontologies hosted by Bioportal4 stands out. This repository hosts ontologies from the biomedical domain,
many of them used in practical applications. The corpus of ontologies used in
this work is a subset of 272 ontologies out of the 438 from the Bioportal snapshot at 30/03/20175 . For brevity, we omit here other details about the corpus
and experiments which are available with additional plots, tables and discussion
regarding influence of expressive power in the technical report [11].
5
Results
5.1
Axiom Dispersion
We start our analysis of dispersion by looking at the cumulative histogram in
Figure 2. For each method there is a line that can be read as the proportion of
strata it produced that have size up to a certain fraction of the original ontology.
Each bin corresponds to increments of 10% of relative size. In addition, since
each method produces a different number of strata, the total number of strata
in the corpus is shown as the N near the method label in the legend.
Cumulative Density
1.0
AxS
(N =
CNS
(N =
BAH
(N =
BAD
(N =
TAH
(N =
0.8
0.6
0.4
0.2
2367)
2087)
3838)
3830)
1512)
0.0
0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
Relative Stratum Size
Fig. 2: Cumulative density histograms of stratum relative sizes over the corpus.
For all methods, at least 60% of strata produced has size up to 10% of their
original ontologies. This is a good indicator that they produce useful rankings.
We notice that BAH and BAD are clear winners regarding the total number
4
5
https://bioportal.bioontology.org/
http://doi.org/10.5281/zenodo.439510
of strata generated6 , while TAH is notably the worst method in this aspect.
Furthermore, BAH and BAD have the largest proportions of strata below the
20% mark and are the ones which gain the least as the relative size increases:
the proportion of their strata with size 80% or more of the original ontology is
negligible. In contrast, TAH has the largest proportions of bigger strata, almost
10% of the strata produced by this method has relative size of 90% or more.
The specificity-based approaches not only generated similar number of strata,
but also resemble each other in the distribution of strata by relative size, as we
can observe by the closeness between their histograms’ curves and their trends.
In the technical report [11], we repeated our studies regarding axiom dispersion and correlations diving the ontologies according to expressive power. The
major difference from what we see in Figure 2 is that BAH and BAD perform extremely well in “rich” ontologies, i.e., on the ontologies that cannot be expressed
in RDFS (RDFS only allows concept inclusions, domain and range restrictions
on roles, and assertions). In these “rich” ontologies, BAH and BAD have largest
proportion of strata with relative size of 20% or less, and practically no strata
with relative size of 80% or more. The reasons for this behaviour, as well as
deeper analysis of expressive power in other metrics are part of our future work.
As discussed in Section 4, the IAR is one of the most important characteristics
that we evaluate in this work, as it gives us a clear indication of how likely it is
that each method improves our ability to choose between two axioms. Figure 3
displays one violin plot for each method, their external shape (the “violin”) is
the estimated probability density function (using kernel density estimation), cut
at the range of the observations. The width of the violin indicates the probability
of an ontology having the given IAR value, considering the sample distribution.
The internal elements are boxplots marking from the centre to the extremities
median, interquartile range and 95% confidence interval.
1.0
IAR
0.8
0.6
0.4
0.2
0.0
AxS
CNS
BAH
BAD
TAH
Fig. 3: Distribution of the indistinguishable axioms rates.
6
The small different occurs because in atomic decompositions where the distance
between the nodes at the top and the bottom is not unique (it depends on the pair
of nodes), depth and height produce different number of strata.
From Figure 3, we can infer that the ⊥-AD height is the best approach
regarding IAR, with more than a quarter of the 272 ontologies having IAR of
less than 0.2. Additionally, the IAR of BAH in at least 75% of the corpus is below
0.5 and the probability density decreases rapidly at higher IAR values. While
⊥-AD depth also has 0 IAR in some cases, its distribution is centred on the
median, which is close to 0.5 IAR, and it has also attained 1 IAR in the corpus.
Excluding BAH, all methods reach 0.4 IAR in at least 50% of the ontologies.
As expected, AxS and CNS have similar IAR distributions. Also, we observe
a small dominance of AxS over CNS, as the median and quartiles are marked
at lower IARs. Still, given the proximity of both distributions and the higher
variability of AxS when compared to CNS, we are not able to decide which one
is better than the other from IAR alone. Neither specificity-based approaches
nor >-AD height were able to produce any ranking with IAR smaller than 0.1,
this is shown by the bottom limits of their (cut) violin plots.
5.2
Correlation
We start our correlation analysis with the local version (as defined in Section 4).
Figure 4 shows violin plots (similar to ones presented in Figure 3). Each violin
plot depicts the distribution of the Spearman’s correlation coefficient between
two metrics for each ontology in the corpus. To avoid counting results with low
significance, we consider only instances of correlation for which the p-value was
below 0.05, i.e., we only count situations whose probability of obtaining the same
Spearman’s coefficient under the null hypothesis was less than 5%. The value of
N in the legend indicates, for each method, how many rankings out of the 272
for passed this criterion, for each pair of methods.
Spearman’s Correlation
1.0
0.5
0.0
-0.5
CN
N Sx
= T
18 AH
7
BA
N Dx
= T
19 AH
6
BA
N Dx
= C
23 NS
9
BA
N Hx
= T
21 AH
1
BA
N Hx
= C
22 NS
4
BA
N Hx
= B
25 AD
2
Ax
N Sx
= T
20 AH
6
Ax
N Sx
= CN
24 S
2
Ax
N Sx
= B
24 AD
6
Ax
N Sx
= B
20 AH
7
-1.0
Fig. 4: Distribution of correlation between the methods (for p-value < 0.05).
As anticipated, CNS often has a very strong positive correlation with AxS and
BAD, while AxS and BAD usually display a weaker, but considerable positive
correlation. Generally, TAH correlates positively with CNS, AxS and BAD, but
weakly. The correlations involving BAH are typically negative and weak, even
though we would expect it to display stronger negative associations with BAD
and TAH. We are confident with these results since in all cases most ontologies
displayed a significant correlation (p-value < 0.05).
Table 2: Corpus correlation values between stratification methods.
AxS AxS AxS AxS BAH BAH BAH BAD BAD CNS
×
×
×
×
×
×
×
×
×
×
BAH BAD CNS TAH BAD CNS TAH CNS TAH TAH
Spearman’s -0.1847 0.3108 0.6110 -0.1080 -0.3372 -0.1646 -0.0120 0.4383 0.0633 0.0532
Methods
Now we direct our attention to the corpus correlation (Table 2). We remind
that we normalized each axiom rank to the interval [0, 1], took the union of
all rankings and only then calculated the associations. For brevity, we omit the
p-values because they were sufficiently small (the largest at the order of 10−16 ).
We can observe in Table 2 some similar relationships to those seen in Figure 4.
For example, the strong positive correlation between AxS and CNS and the
average negative one between BAH and BAD. Also, there is a noticeable positive
association between BAD and both AxS and CNS. The other interactions have
values of correlation too close to zero to indicate any consistent association.
All results in this section reflect the behaviour of the methods in a particular
corpus. It may be the case that the overall results differ if we consider ontologies
from a different domain, or include more ontologies from the same corpus.
6
Conclusion and Future Work
In this work we discussed desirable traits for stratification procedures and evaluated five distinct methods according to these criteria. We focused on axiom
dispersion and correlation, devised suitable metrics to evaluate the methods and
discussed the results obtained. The methodology we followed gives insight into
a crucial component for automatic debug and repair of ontologies.
We observed that AxS and CNS displayed similar behaviour and significant
positive association following our expectations. However, the same cannot be
said about BAH and TAH. Besides the low correlation, their behaviour is very
different regarding axiom dispersion: ⊥-AD height being one of the most successful according to our metrics, and >-AD height the worst in number of strata
and relative size distribution. Interestingly, ⊥-AD depth had similar behaviour
regarding dispersion as ⊥-AD height, but displayed a significant correlation with
the specificity-based methods.
As future work we propose the extension of this study covering other stratification approaches, including other metrics related to the atomic decomposition
and other definitions of specificity. Moreover, other characteristics of the stratification methods such as ease of interpretation and computational performance
also need to be studied in detail. Such extended experiments will also include
a larger and more diversified corpus and analyse whether the expressive power
(i.e. the OWL 2 DL fragment considered) affects performance and correlation.
Furthermore, the ability to distinguish axioms, feasibility and ideal of importance can vary according to the application, thus evaluating the effects of a range
of stratification approaches in a particular setting, such as ontology repair, is another possible direction. In this sense, some applications may also provide a way
to compare stratification methods which depend on other parameters besides
the ontology itself (such as syntactic connectivity [5, 10]).
References
1. Baader, F., Hollunder, B.: Priorities on defaults with prerequisites, and their application in treating specificity in terminological default logic. J. of Automated
Reasoning 15(1), 41–68 (1995)
2. Baader, F., Horrocks, I., Lutz, C., Sattler, U.: An Introduction to Description
Logic. Cambridge University Press (2017)
3. Benferhat, S., Baida, R.E., Cuppens, F.: A stratification-based approach for handling conflicts in access control. In: 8th ACM Symposium on Access Control Models
and Technologies. pp. 189–195 (2003)
4. Benferhat, S., Bouraoui, Z., Papini, O., Würbel, E.: Prioritized assertional-based
removed sets revision of DL-Lite belief bases. Annals of Mathematics and Artificial
Intelligence 79(1-3), 45–75 (2017)
5. Cóbe, R.: Integração entre múltiplas ontologias: reúso e gerência de conflitos. Ph.D.
thesis, Universidade de São Paulo (2014)
6. Cuenca Grau, B., Horrocks, I., Kazakov, Y., Sattler, U.: Modular Reuse of Ontologies: Theory and Practice. J. Artificial Intelligence Research 31, 273–318 (2008)
7. Cuenca Grau, B., Horrocks, I., Motik, B., Parsia, B., Patel-Schneider, P.F., Sattler,
U.: OWL 2: The next step for OWL. J. of Web Semantics 6(4), 309–322 (2008)
8. Del Vescovo, C.: The Modular Structure of an Ontology: Atomic Decomposition
and its Applications. Ph.D. thesis, University of Manchester (2013)
9. Fagin, R., Ullman, J.D., Vardi, M.Y.: On the Semantics of Updates in Databases.
In: ACM Symposium on Principles of Database Systems. pp. 352–365 (1983)
10. Guimarães, R., Wassermann, R.: Local Change in Ontologies with Atomic Decomposition. In: 3rd Joint Ontology Workshops (JOWO). CEUR, vol. 2050 (2017)
11. Guimarães, R., Sattler, U., Wassermann, R.: Ontology stratification: a comparative
study. Tech. rep., University of São Paulo, University of Manchester (2018), https:
//www.ime.usp.br/~ricardof/womocoe2018_tr.pdf
12. Nebel, B.: Syntax-Based Approaches to Belief Revision. In: Belief Revision. Cambridge Tracts in Theoretical Computer Science, Cambridge University Press (1992)
13. Parsia, B., Schneider, T.: The Modular Structure of an Ontology: An Empirical
Study. In: 12th International Conference on Principles of Knowledge Representation and Reasoning (KR). AAAI Press (2010)
14. Qi, G., Pan, J.Z.: A stratification-based approach for inconsistency handling in description logics. In: International Workshop on Ontology Dynamics-IWOD (2007)
15. Sattler, U., Schneider, T., Zakharyaschev, M.: Which Kind of Module Should I
Extract? In: Proceedings of the 22nd International Workshop on Description Logics
(DL 2009), Oxford, UK, July 27-30, 2009. CEUR Workshop Proceedings, vol. 477.
CEUR-WS.org (2009)