Online from: 1977
Subject Area: Library and Information Studies
Options: To add Favourites and Table of Contents Alerts please take a Emerald profile
|Title:||A proposal for textual encoding based on semantic web tools|
|Author(s):||Giovanni Tummarello, (Digital Enterprise Research Institute, Galway, Ireland), Christian Morbidoni, (Semantic Web and Multimedia Group, Dipartimento di Elettronica, Intelligenza Artificiale e Telecomunicazioni, Università Politecnica delle Marche, Ancona, Italy), Paolo Puliti, (Semantic Web and Multimedia Group, Dipartimento di Elettronica, Intelligenza Artificiale e Telecomunicazioni, Università Politecnica delle Marche, Ancona, Italy), Francesco Piazza, (Semantic Web and Multimedia Group, Dipartimento di Elettronica, Intelligenza Artificiale e Telecomunicazioni, Università Politecnica delle Marche, Ancona, Italy)|
|Citation:||Giovanni Tummarello, Christian Morbidoni, Paolo Puliti, Francesco Piazza, (2008) "A proposal for textual encoding based on semantic web tools", Online Information Review, Vol. 32 Iss: 4, pp.467 - 477|
|Keywords:||Encoders, Extensible Markup Language, Makeup languages, Resource description framework, Semantics, Worldwide web|
|Article type:||Research paper|
|DOI:||10.1108/14684520810897340 (Permanent URL)|
|Publisher:||Emerald Group Publishing Limited|
|Acknowledgements:||The research presented in this paper was supported (in part) by the Lion Project and by the Science Foundation Ireland under Grant No. SFI/02/CE1/I131, and in part by the European project DISCOVERY No. ECP-2005-CULT-038206. Gratitude goes to Elena Pierazzo for assistance and for the overlapping TEI-encoded examples used in this paper, and to Fabio Kepler for the first TEI implementation.|
Purpose – The purpose of this paper is to investigate and prove the feasibility of a semantic web (SW) based approach to textual encoding. It aims to discuss benefits and novel possibilities with respect to traditional XML-based approaches.
Design/methodology/approach – The markup process can be seen as a task of knowledge representation where elements such as words, sentences and pages are instances of conceptual classes forming a semantic network. An ontology web language ontology for textual encoding has been developed, capturing structural and grammatical aspects. Different approaches and tools to query the encoded text are investigated.
Findings – resource description framework (RDF) is powerful and expressive enough to fulfil tasks traditionally done in XML as well as to enable new possibilities such as collaborative and distributed textual encoding and the use of ontology-based reasoning in text processing and querying. While the encoding of overlapping hierarchies through the use of existing approaches is often complex and leads to idiosyncratic solutions, this problem is naturally solved using SW languages.
Research limitations/implications – To make the approach suitable for widespread adoption, further work is required both in ontologies modelling and in applications (e.g. markup editing).
Practical implications – The prototype implementation imports existing encoded texts, transforms them into RDF-based markups and uses SW query languages to answer cross-hierarchy queries. Existing tools (reasoners, search and query engines, etc.) can be used immediately.
Originality/value – This methodology enables distributed interoperability and reuse of previous encoded results and opens the way to novel collaborative textual markup scenarios.
To purchase this item please login or register.
Complete and print this form to request this document from your librarian