Академический Документы
Профессиональный Документы
Культура Документы
I. I NTRODUCTION
Semantic annotation links content expressed in documents
to concepts in ontologies. Ontologies provide a specification of
concepts in a domain and how they relate to each other [1]. As
a result, semantic annotation facilitates unambiguous access
to document content. Document segment annotation marks-up
units within documents such as chapters and sections, with
representative concepts from domain ontologies. This enables
search models to target segments within documents [2] and to
link semantically related content [3]. Successful annotation of
documents have been shown to improve information retrieval
but are largely dependent on manual annotation which is
tedious, time-consuming and lacks scalability [4]. This paper
proposes the recommendation of ontology concepts to annotate
document segments based on the similarity between the content of a segment and context of concepts. Concepts whose
contexts are most similar to the segment are recommended
as annotation. Typically, domain-specific ontologies lack adequate textual contexts for this purpose. Therefore, with textual
labels of ontology concepts forming queries to search engines,
we augment concepts with contextual texts extracted from web
documents.
II. BACKGROUND AND R ELATED WORK
Electronic document authoring and mass digitisation efforts
have made vast amounts of domain-rich content available on
the web. To improve access to such content, several approaches
have been proposed in literature to semantically annotate
segments within documents. Document segments have been
annotated using texts in segment titles [2]. This approach
assumes that an author expresses the content of a segment in
the title which matches the labels of corresponding ontology
2 M SCS(x, y)
N (x) + N (y) + 2 M SCS(x, y)
(1)