Вы находитесь на странице: 1из 8

ConTeXtualized local ontology specification via CTXML

Paolo Bouquet1 Antonia Dona2 Luciano Serafini2 and Stefano Zanobini1


1 Dept. of Computer Information and Communication Technologies, University of Trento, Italy
2 ITC-Irst, Via Sommarive Localita Povo, 38050 Trento, Italy
bouquet@dit.unitn.it, antodona@itc.it,
serafini@itc.it, stefano.zanobini@libero.it

Abstract A common approach to overcome semantic heterogene-


ity is to introduce shared representations of meanings. In-
In many application areas, such as the semantic web, knowl- deed, it is obvious that designing any semantic-based ap-
edge management, distributed databases, its been recognized plication would be much easier if people agreed to orga-
that we need an explicit way to represent meanings. A ma- nize and exchange information in a common language with
jor issue in all these efforts is the problem of semantic inter-
operability, namely the problem of communication between
a common semantic. However, in general this is not the
agents using languages with different semantic. Following case. Despite the effort for defining a standard semantic
(Bonifacio, Bouquet, & Traverso 2002), we claim that a tech- for various domains, people seem to resist to such an at-
nological infrastructure for semantic interoperability between tempt of homogenization. Partly, this is due to practical
semantically autonomous communities must be based on problems (it can be very costly to change the overall or-
the capability of representing local ontologies and mappings ganization of a database, or the classification of large col-
between them, rather than on the attempt of creating a global, lections of documents. But we believe that there are also
supposedly shared, conceptualization. The goal of this pa- theoretical reasons why this homogeneity is not accepted,
per is to define a theoretical framework and a concrete lan- and in the end is not even desirable. In fact, lots of cog-
guage for the specification of local ontologies and mappings nitive and organizational studies show that there is a close
between them.
relationship between knowledge and identity. Knowledge is
not simply a matter of accumulating true sentences about
the world, but is also a matter of interpretation schemas (e.g.
Introduction paradigms (Kuhn 1979), contexts (Benerecetti, Bouquet, &
In many application areas, such as the semantic web, knowl- Ghidini 2000), mental models (Johnson-Laird 1983), per-
edge management, distributed databases, its been recog- spectives (Boland & R.V.Tenkasi 1995), . . . ), which allow
nized that we need an explicit way to represent meanings. people to make sense of what they know. Therefore, any
This is needed to enhance (or enable) services like search, attempt of imposing external interpretation schemas (and a
knowledge and service integration, discovery. This pro- definition of meaning always presupposes some interpreta-
duced efforts like ontology development, semantic-based tion schema, at least implicitly) is perceived as an attack to
markup languages, agent communication languages. an individuals or a communitys identity. Moreover, inter-
A major issue in all these efforts is the problem of se- pretation schemas are an essential part of what people know,
mantic interoperability. Even in restricted domains, like as each of them provides an alternative lens through which
medicine, tourism, banking, automotive, it is very difficult reality can be read. Thus, imposing a single schema is al-
to find an agreement on common vocabularies and shared ways a loss of global knowledge, as we throw away possibly
conceptualizations. This leads to a situation in which differ- innovative perspectives.
ent agents use the same word to mean different things, use If we accept that interpretation schemas are important,
different words to mean the same thing, use different granu- then we need to approach the problem of semantic interop-
larity to describe the same domain, describe a domain from erability from a different perspective. Instead of pushing to-
a different perspective, and so on. All together, this is what wards a greater uniformity, we need a theoretical framework
researchers call semantic heterogeneity, namely a situation in which:
in which agents do not understand each others as they use different conceptualizations (called local ontologies)
languages with heterogeneous semantic. can be autonomously represented and managed (and,
This work has been partially supported by the project therefore, we call them contextualized);
EDAMOK Enabling Distributed and Autonomous Management of people can discover and represent relationships between
Knowledge, funded by the Provincia Autonoma di Trento with de- local ontologies;
liberation number 1060 on date 4/5/2001.
Copyright c 2002, American Association for Artificial Intelli- the relationships between local ontologies can be used to
gence (www.aaai.org). All rights reserved. provide semantic-based services without destroying the
semantic identity of the involved parties. 1995)), and contributes to define a communitys identity.
In general, different perspectives are not completely unre-
We see meaning negotiation as the process that dynami-
lated. For example, two communities may have different
cally enable agents to discover relationships between local
viewpoints on the same domain, or have overlapping inter-
ontologies. The goal of this paper is to create an environ-
ests. Possible existing relations between different perspec-
ment in which the preconditions for meaning negotiation
tives can be seen as mappings between the relative different
are satisfied. In particular, on the one hand, we define a
and autonomous conceptualizations. These mappings can-
theoretical framework in which local ontologies and map-
not be defined beforehand, as they presuppose a complete
pings between them can be represented; on the other hand,
understanding of the two conceptualizations, which in gen-
we provide a language for describing what we call a con-
eral is not the case. This means that mappings can be discov-
text space, namely a collection of contexts and their map-
ered dynamically. e.g. through communication, experience,
pings; this language is called ConTeXt Markup Language
trial and error. In other words, through meaning negotia-
(CTXML) and is based on XML and XML-Schema. Lo-
tion. We are now going to describe the theoretical frame-
cal ontologies are represented as contexts, in the sense dis-
work presented in the introduction with respect to such a
cussed in formal papers like (Ghidini & Giunchiglia 2001;
KM scenario.
Benerecetti, Bouquet, & Ghidini 2000). In what follows, we
will use knowledge management as our main motivation for
contextualized local ontologies; however, as we said at the Contexts
beginning, we believe that similar motivations can be found The abstract representation of a context is a triple
in any semantically distributed application, e.g. the semantic hc, A , R i, where: c is a context identifier; A is a collection
web. of explicit assumptions; and R is an explicit representation.
The context identifier c is a unique identifier associated
Contextualized local ontologies in knowledge with a context; explicit assumptions are attributes (parame-
management ter/value pairs) that provide meta-information about the con-
Knowledge has been recognized as one of the most impor- text (e.g., the context owner, or its history); the explicit rep-
tant assets of modern organizations. (Drucker 1994) claims resentation is the real content of a context, namely a concep-
that we are entering the knowledge era, in which the basic tualization, and is represented as a labelled tree. Possible
economic resource is no longer capital, or natural resources, reference models for the content of contexts can be based
or labour, but knowledge. on first order logics, propositional logics, description log-
As a managerial practice, Knowledge Management (KM) ics, general graph structures (graphs, acyclic graphs, lattices,
can be described as a collection of methodologies and tools etc.), concept hierarchies, and so on. In this paper, we chose
that provide support in creating new knowledge within to use concept hierarchies (in the sense defined in (Buchner
the organization and codifying such newly created knowl- et al. 1999)).
edge into storable objects (e.g. documents, repositories, Concept hierarchies are built from a set L of labels. L is
databases, procedures, forms). composed by two disjoint subsets:
(Bonifacio, Bouquet, & Traverso 2002) shows that in KM 1. LC (concept labels);
we can find the same dichotomy between centralized and 2. LR (relation labels).
distributed approaches to knowledge, and provides motiva-
tions to support a distributed approach to KM, namely an LR is split into hierarchical (LH ) and non-hierarchical or
approach that starts from the recognition that there exist au- general labels (LG ).
tonomous communities within an organization and that tech-
Definition 1 (Concept hierarchy). A concept hierarchy is
nology should supports knowledge exchange not by elim-
a graph H = hC, Ei, where C is a finite set of nodes, E a
inating differences, but by designing systems that will en-
finite set of directed edges between nodes, and all the nodes
able semantic interoperability between autonomous com-
and edges have a label from L, such that the edges labelled
munities.
with hierarchical labels form a tree.
We assume that autonomous communities organize their
(local) knowledge according to a local ontology, i.e. a set We dont put any restriction on the set LC of concept la-
of terms and relations between terms, which provides a con- bels, whereas we require that LR = {is-a, part-of, inst-of},
ceptualization of the piece of world which is relevant for the where: is-a represents the subclass relation (for instance
objectives of that community. Examples of local ontologies Cat is-a Animal, Man is-a Mortal); part-of represents
are: a set of shared directories, a taxonomy used by a scien- the relation of being part of (for instance, Leg part-of
tific community, and so on. Each community (team, group, Man, Tenor part-of Choir); inst-of represents the fact that
and so on) within an organization has its own conceptualiza- a certain individual is an instance of a concept (for instance
tion of the world, which is partial (i.e., covers only a portion Paolo inst-of Man, Michele inst-of Tenor).
of the world), approximate (i.e., has a degree of granular- Contexts can be given a concrete representation using dif-
ity), and perspectival (i.e., reflects the communitys view- ferent languages: XML, XML-schema, KIF, CycL, Ontolin-
point on the world). Each local conceptualization represents gua, DAML-OIL, RDF, RDF-schema (see (Corcho & Perez
a communitys perspective on the world (see for example the ) for a survey on these languages). Among them, we decided
concept of perspective making in (Boland & R.V.Tenkasi to adopt XML and XML-Schema.

2
The concrete representation of a context is an XML doc- Context.xsd

ument composed of two main parts: the header and the con- What
tent. The header is an XML document containing the fol- we define
in this Context23.xml
lowing components: document ...
(an XML schema <Content>
for a generic Instance of
Owner The owner of the context. This is important in KM context)
applications, as it can be used, for example, to grant per- <complextype name = "article">
missions, and to allow users to assign a degree of con- <attribute name = "author type="strung"/>
</complextype>

fidence to a context based on their trust in the context


owner. </Content>
...
Group The group in which this context has been developed.
Instance of
This is necessary as the owner can be a member of differ- Doc23summary.xml

ent groups. <article>


<author> Quantum phisics
Security Contains information about the access rights, paolo rossi Text minimg paolo rossi

blaldkf;j fak sdkaljfalklk


</author>
passwords, encryption, etc. </article>
lkadflakfjd;
lakdjf;lakfa
lkajf;lkajd;fl
;lakjf;lakjdf
;lakdjf;lakjdf
klasd akfha
akfha kahdf aa =
kadh aj f ajf ad
la;jdf
;ladjf

History Contains information on how a context was gener- lakjd;flkajd


akd alfja alkf
l alfja flla fk
lajd ;a;dfkja f

a;lkdfja;ldfkj

ated. Examples are: from scratch, by combining (por-


tions of) existing contexts, by copying some existing
Figure 1: CTXML
context.
The content of a context is a concrete representation of a Entertainment

concept hierarchy in an XML-schema. Since XML-schema SEE_ALSO


isa
is an XML document, an XML-schema document can be in- isa
isa isa
isa isa
cluded in the XML-document that describes a context. The
Shopping Games Music Comics & Movies & Television
elements and the edges of a concept hierarchy are repre- Animation Films
sented via XML-schema constructs as follows: partOf isa Syn: Movies isa isa

1. Each node of a concept hierarchy is represented either as Credit Cards SoundTracks


a complexType or as an element. For example, the con- instanceOf
VHS Actors Education Animation

cept Man and its instance Paolo would be represented instanceOf isa isa
as follows:
Platoon O.S.T.
Stefano Accorsi
<complexType name="Man"/> Cartoons Artists

<element name="Paolo" type="Man"/> isa

2. The relation is-a is represented via the element SEE_ALSO


Warner Bros
extension. Thus, the relation Man is-a Mortal is rep-
instanceOf
resented as follows:
<complextype name="Man"> Looney Tunes

<extension base="Mortal"/>
</complexType> Figure 2: The context structure Entertainment
3. The relation part-of is represented through a combination
of the primitives element and extension. For exam- <element name="NokiaAB"
ple, to declare that Keyboard, Monitor, and Case type="PCMonitor"/>
are elements of a Personal Computer, one can use the
following XML-schema: In order to validate a context, we can write down a DTD,
or another XML-schema, that describes the structure of the
<complextype name="PC"> XML-schema for Concept Hierarchies. For uniformity rea-
<extension base="anything"> sons, we chose to express the structure of a context in an
<element name="keyboard" XML-schema document. An overview of the intuition un-
type="PCKeyboard"/> derlying the concrete representation of the content (i.e. the
<element name="monitor" explicit representation) of a context is depicted in Figure 1.
type="PCMonitor"/> Context.xsd is an XML schema that describes the struc-
<element name="case" ture of a generic context; any context can be specified as
type="PCCase"/> an XML document (e.g., context23.xml) of the type de-
</extension> scribed by context.xsd.
</complexType>
4. The relation inst-of is represented through the primitive An example
element. The example shows the declaration of the fact In this section we present an example of two contexts that
that that NokiaAB inst-of PCMonitor. represent the domain of Entertainment; the conceptual struc-

3
Arts &
tures are portions of the web directory structures of two Entertainment

well-known Internet search engines. The graphical repre- isa


isa
sentation of the two structures is shown in Figure 2 and Fig- isa isa isa

ure 4, respectively. Newspaper Games Magazines Music Movies


Hierarchical arcs of both context structures are not la- partOf RELATED_TO
isa
isa
isa
isa
belled by any relation, as this information is left implicit. partOf isa

These relations can be guessed on the basis of the seman- Console Cartridges SoundTracks DVD On Video In Theatre
IMAX

tics of the labels of the nodes. In the pictures above, instanceOf


Syn: Home Video
Syn: VHS
we deliberately added relation labels as an example. Fur- Syn: Video Tape

thermore, both contexts contain a non-hierarchical relation, PlayStation RELATED_TO

called SEE ALSO.


As we said, a context is represented as an XML docu-
Figure 4: The context structure Arts and Entertainment
ment composed of two parts: header and content (i.e., the
concept hierarchy). The context header of Entertainment
is described below, while the the concept hierarchy is repre-
hi-fi for cars is conceptualized in the same way in the two
sented in Figure 3. For the sake of simplicity, we chose to
contexts. Nonetheless, the concepts in the two contexts are
report in Figure 3 only a subset of concepts and relations of
in some relation. Of course, it is essential that we provide a
the original concept hierarchy.
way of defining such a relation between concepts of the two
<ctxHeader ctxId="cxt01" contexts (for example, to return relevant documents from a
label="Entertainment" context as an answer to a query asked in the other context).
add-manager="antodona" This is achieved by introducing the idea of a context map-
status="draft"> ping. In this section we give a data structure for context
<owner> mapping.
<agentId>Agent02</agentId> The general intuitions behind a context mapping are the
</owner> following:
<group>
<groupId>Entertainment</groupId> a context mapping is directional. Indeed, we want to be
</group> able to represent the situation in which a context c1 im-
<security> ports information from another context c2 using a certain
<accessRights>read</accessRights> context mapping m, without forcing c2 to use the same
<encription>none</encription> (or the inverse) mapping m to import the information for
</security> context c1 .
<history> a context mapping should not be limited to the represen-
<version>v1.0</version> tation of equivalence between concepts of two different
<generatedFrom> contexts. It should allow for the representation of rela-
<operationType>scratch</operationType> tions between concepts at different abstraction level. For
<elementUri/> instance, a context mapping should be able to represent
</generatedFrom> the fact that the concept printer in a context is more gen-
</history> eral than the concept laser printer in some other context.
</ctxHeader>
A mapping is a relation between a context (called source)
In Figure 3, one can easily recognize a piece of XML- and another context (called
Schema with definitions of new complexType and element target), and
it can be formally
represented by a 4-tuple m, cs , M , ct , where m is a unique
as components of a new schema. The last part of the doc- identifier for the mapping; cs and ct are two different context
ument is the reference to existing mappings for the defined identifiers, for the source and the target context respectively;
context: M is the real mapping, i.e. the actual relation between the
<mappingForCtx mapId="map01"/> explicit representations of cs and ct .
The other selected context is the one of the Arts and En- Let us define the abstract structure of mappings between
tertainment. The concept hierarchy is depicted in Figure 4. two context hierarchies.
Definition 2. A context mapping is a 4-tuple m, cs , M , ct ,

For lack of space, we do not provide the XML code of this
context. where:

Mappings 1. m a unique identifier associated with a mapping;


The explicit representations of two contexts can describe a 2. cs and ct are distinct context identifiers, called source and
common portion of the world. For instance, a context about target context;
cars components and a context about radio and hi-fi 3. M is concept mapping from the content of the source con-
might overlap on radio and hi-fi for cars. Despite this fact, text to the content of the target context. Where a context
it is not guaranteed that the common part about radio and mapping from a concept hierarchies Hs = hCs , Es i to Ht =

4
<ctxContent language="en">
<CHContent>
<schema xmlns="http://www.w3.org/2001/XMLSchema"
targetNamespace="http://.../entertainmentYahoo">
<complexType name="Entertainment">
<annotation>
<appinfo><label>Entertainment</label></appinfo>
</annotation>
</complexType>
<complexType name="Entertainment_Shopping">
<annotation>
<appinfo><label>Shopping</label></appinfo>
<documentation> This is the node Shopping which IS_A Entertainment </documentation>
</annotation>
<complexContent>
<extension base="Entertainment">
<element name="CreditCards" type="Entertainment_Shopping_CreditCards"/>
</extension>
</complexContent>
</complexType>
<complexType name="Entertainment_Shopping_CreditCards">
<annotation>
<appinfo><label>CreditCards</label></appinfo>
<documentation> This is the node CreditCards which is
PART_OF the node Shopping </documentation>
</annotation>
</complexType>
<complexType name="Entertainment_Music">
<annotation>
<appinfo><label>Music</label></appinfo>
<documentation> This is the node Music which IS_A Entertainment </documentation>
</annotation>
<complexContent>
<extension base="Entertainment"/>
</complexContent>
</complexType>
<complexType name="Entertainment_Music_SoundTracks">
<annotation>
<appinfo><label>SoundTracks</label></appinfo>
<documentation>
This is the node SoundTracks which IS_A Entertainment_Music
</documentation>
</annotation>
<complexContent>
<extension base="Entertainment_Music"/>
</complexContent>
</complexType>
<element name="PlatoonO.S.T." type="Entertainment_Music_SoundTracks">
<annotation>
<appinfo>
<label>PlatoonO.S.T</label>
</appinfo>
<documentation>
This is the node PlatoonO.S.T which is an
INSTANCE_OF Entertainment_Music_SoundTracks
</documentation>
</annotation>
</element>
</schema>
</CHContent>
</ctxContent>

Figure 3: XML for the concept hierarchy

5
Entertainment

Shopping
Games Comics & Television
Music Movies &
Animation Films
Credit Cards

SoundTracks VHS Actors


Education Animation
Less
General
Platoon O.S.T. Than
Stefano Accorsi Cartoons Artists

Warner Bros

Equivalent Compatible
Looney Tunes

Arts & Entertainment

Newspaper
Games Magazines Music Movies

SoundTracks IMAX
Console Cartridges DVD On Video In Theatre

PlayStation

Figure 5: Mapping between Entertainment and Arts and Entertainment

D E
w v
hCt , Et i, is a tuple of relations , , , , target contexts, and so on); and a content, namely the actual
mapping between the explicit representations of the source
each of which is a subset of Cs Ct .
and target contexts.
w
Intuitively c1 c2 means that c1 is more general than c2
v
(e.g., animal is more general than dog); c1 c2 means that
An example
v Figure 5 shows a graphical representation of a subset of the
c1 is less general than c2 ; c1 c2 means that c1 c2 and
w possible relations between concepts of Entertainment and
c1 c2 ; c1 c2 means that c1 is disjoint from c2 (e.g., concepts of Arts and Entertainment.

mountain is disjoint from sea), c1 c2 means that c1 is As contexts, mappings are represented as an xml file, in-
compatible with c2 (e.g., cars are compatible with hi-fi, as stance of a defined XML-Schema, and composed by two
there are hi-fi for cars). parts: the header and the set of relations. For the example
Distributed Description Logics (Borgida & Serafini 2002) of Figure 5 the header is the following, while the content of
provides a basic formal framework in which multiple con- the mapping is represented in Figure 6. Note that there are
cept hierarchies related mappings can be described. v
examples of (Less General Than), (Compatible),
The concrete representation of a mapping is an XML doc-
(Equivalent) relations between concepts. The definition
ument which structure is described by the another XML-
of all the other relation is similar except for the name.
schema. Similary to contexts, a concrete mapping is com-
posed of two parts: a header, which contains attributes of <mapHeader mapId="map01">
the mapping (e.g., an identifiers, the ids of the source and <owner>

6
<mapContent>
<CH2CHContent>
<schema xmlns="http://www.w3.org/2001/XMLSchema"
targetNamespace="http://.../Yahoo2Epinions"
xmlns:ctxSource="http://.../entertainmentYahoo"
xmlns:ctxTarget="http://.../artsEntertainmentEpinions">
<complexType name="lessGeneralThan">
<choice>
<element name="pair1">
<complexType>
<sequence>
<element type="ctxSource:Entertainment" name="source"/>
<element type="ctxTarget:artsEntertainment" name="target"/>
</sequence>
</complexType>
</element>
</choice>
</complexType>
<complexType name="equivalent">
<choice>
<element name="pair1">
<complexType>
<sequence>
<element type="ctxSource:Entertainment_Games" name="source"/>
<element type="ctxTarget:artsEntertainment_Games" name="target"/>
</sequence>
</complexType>
</element>
</choice>
</complexType>
<complexType name="compatible">
<choice>
<element name="pair1">
<complexType>
<sequence>
<element type="ctxSource:Entertainment_Television" name="source"/>
<element type="ctxTarget:artsEntertainment_Movies_OnVIdeo" name="target"/>
</sequence>
</complexType>
</element>
</choice>
</complexType>
</schema>
</CH2CHContent>
</mapContent>

Figure 6: XML for mapping

7
<agentId>Agent02</agentId> Drucker, P. 1994. Post Capitalist Society. Cambridge Uni-
</owner> versity Press.
<group> Ghidini, C., and Giunchiglia, F. 2001. Local models se-
<groupId>Entertainment</groupId> mantics, or contextual reasoning = locality + compatibility.
</group> 127(2):221259.
<security>
Johnson-Laird, P. N. 1983. Mental Models. Cambridge
<accessRights>modify</accessRights>
University Press.
<encription>none</encription>
</security> Kuhn, T. 1979. The structure of Scientific Revolutions.
<history> University of Chicago Press.
<version>v1.0</version>
<generatedFrom>
<operationType>scratch</operationType>
<elementUri/>
</generatedFrom>
<generatesList/>
</history>
<mapSource ctxId="cxt01" ctxVersion="v1.0"/>
<mapTarget ctxId="ctx02" ctxVersion="v1.0"/>
</mapHeader>

Conclusions
In this paper we described a theoretical framework for con-
texts and mapping for distributed knowledge management.
We also provide a concrete language that allow to specify
contexts and mappings. In a further work (also presented to
this workshop) we have outlined the main ideas of a linguis-
tic based algorithm for automatic discovering of mappings
between contexts.

References
Benerecetti, M.; Bouquet, P.; and Ghidini, C. 2000. Con-
textual Reasoning Distilled. Journal of Theoretical and Ex-
perimental Artificial Intelligence 12(3):279305.
Boland, J., and R.V.Tenkasi. 1995. Perspective making and
perspective taking in communities of knowing. Organiza-
tional Science 6(4):350372.
Bonifacio, M.; Bouquet, P.; and Traverso, P. 2002.
Enabling distributed knowledge management. manage-
rial and technological implications. Novatica and Infor-
matik/Informatique III(1).
Borgida, A., and Serafini, L. 2002. Distributed de-
scription logics. In Horrocks, I., and Tessaris, S.,
eds., Proceedings of the 2002 Intl. Workshop on De-
scription Logics (DL2002). Toulouse: CEUR-WS.
ONLINE: http://CEUR-WS.org/Vol-53/, ARCHIVE:
ftp://SunSITE.Informatik.RWTH-Aachen.DE/pub/
publications/CEUR-WS/Vol-53.tar.gz.
Buchner, A.; Ranta, M.; Hughes, J.; and Mantyla, M. 1999.
Semantic information mediation among multiple product
ontologies. In Proc. 4th World Conference on Integrated
Design & Process Technology.
Corcho, O., and Perez, A. G. A roadmap to ontology spec-
ification language. In Knowledge Engineering and Knowl-
edge Management. Methods, Models and Tools. Prooceed-
ings of the 12th International Conference, EKAW 2000,
8096.

Вам также может понравиться